Diagnosis and treatment of diseases caused by mutations in CD72

ABSTRACT

Mutants of the CD72 gene and their protein products are disclosed. Methods of diagnosing various diseases such as thyroid follicular carcinoma, renal cancer, endometrial adenocarcinoma, leukemia, ovarian cancer, lymphoma and lupus erythematosus by detecting particular mutations in human CD72 are also disclosed. Methods for supplying wild-type to cells which have lost normal CD72 function is additionally disclosed. CD72 associated diseases are additionally disclosed.

CROSS-REFERENCE TO RELATED APPLICATION

[0001] This application claims the benefit of U.S. provisional application Serial No. 60/325,946, filed Sep. 28, 2001.

BACKGROUND OF THE INVENTION

[0002] 1. Field of the Invention

[0003] The present invention relates to diseases resulting from hyperproliferation or hyperactivity of cells. In human thyroid follicular carcinoma, leukemia, renal cancer, endometrial adenocarcinoma, ovarian cancer, chondrosarcoma, lymphoma and human lupus erythematosus, expression of mutated forms of CD72 is described.

[0004] 2. Background Art

[0005] CD72 is a type II membrane glycoprotein with its N-terminus on the cytoplasmic side of the membrane. (For review, see Parnes and Pan, Immunol. Rev. August 2000; 176:75-85, 2000). CD72 has been found to be expressed by a number of cell types including B cells, T cells, vascular smooth muscle cells and a number of tumor cells (Robinson et. al., Immunogenetics. 1997;45(3):195-200, Hammer R. D., et al., Am. J. Surg. Pathol. May 1996;20(5):613-26, Garand R., et al, Leuk. Res. August 1994;18(8):651-2, Gagro A., et al., Leuk. Lymphoma. April 1997; 25(3-4):301-11., Hishima T., et al., Am. J. Pathol. August 1994;145(2):268-75, Mechtersheimer G., et al., Pathol Res Pract. August 1990;186(4):427-38., Schwarting et al., Am. J. Hematol. November 1992;41(3):151-8.)

[0006] Four different alleles have been identified for CD72 in mice. Sequence analysis of CD72 between mouse strains revealed that the cytoplasmic domain of CD72 is highly conserved while the extracellular domain is highly polymorphic, especially at the membrane-distal portion of the protein (Robinson et al., supra).

[0007] CD72 was initially implicated in B cell regulation, suggesting a role in the immune system. Ligation of CD72 with antibody leads to B cell proliferation and the phosphorylation of the Src kinases Lyn and Blk (Venkataraman C et. al, Eur. J. Immunol. October 1998;28(10):3003-16.). These Src kinase family members are also phosphorylated during B cell activation through the B cell receptor. However, CD72 has been found to be expressed in many other tissues as well. This expression in other tissues suggests that CD72 may have roles unrelated to immune regulation.

[0008] CD72 has two immunoreceptor tyrosine-based inhibitory motifs (ITIMs). The first ITIM has been demonstrated to bind to SHP-1 phosphatase (Adachi T. et al., J. Immunol. May 15, 1998;160(10):4662-5). Presumably, SHP-1 can dephosphorylate phosphorylated tyrosines in B cell activation and other cell activation events (Adachi T, et. al., J. Immunol. Feb. 1, 2000;164(3):1223-9). The second ITIM has been shown to bind to Grb2, not SHP-1 (Wu Y., et., al., Curr. Biol. Sep. 10, 1998;8(18):1009-17). Grb2 binds to Sos, a guanine nucleotide exchange factor for Ras (Li N., et al., Nature May 6, 1993;363(6424):85-8., Egan S. E., et al., Nature May 6, 1993;363(6424):45-51). The Ras pathway is involved in cell proliferation and differentiation (for review, see Lodish et. al., Molecular Cell Biology, Freeman and Company 2000).

[0009] The Class IV semaphorin CD100 has been shown to be the receptor for CD72 (Kumanogoh A., et al., Immunity November 2000;13(5):621-31). Semaphorins have been implicated in neuron development, organogenesis, and cancer. CD100 expression has also been demonstrated in a number of tissues, including, neural tissues, kidney, lung, and heart (Hall K. T., et al., Proc. Natl. Acad. Sci. USA. Oct. 15, 1996;93(21):11780-5). Additionally, CD100 is the only semaphorin reported to be expressed in T cells (Shi W, et al., Immunity November 2000;13(5):633-42.). Soluble CD100 was also shown to be released during T cell activation and in an animal model for lupus (Delaire S, et al., J. Immunol. Apr. 1, 2001;166(7):4348-54., Wang X., et al., Blood. Jun. 1, 2001;97(11):3498-504).

[0010] Many diseases are either the result of the hyperpoliferation or hyperactivity of cells. Cancers result from the hyperpoliferation of otherwise normal cells. Several infectious diseases such as Epstein-Barr, HPV and CMV also cause the hyperproliferation of cells as a result of infection. Autoimmune diseases such as lupus erythematosus and allergies result in a hyperactivation of T cells or B cells.

[0011] Many cell surface molecules have been implicated in the regulation of cell activation and cell growth. Cells are subject to both positive cell regulation, such as activation or growth stimulation by hormones, and negative regulation, such as inhibition of cell growth by extracellular ligands. Negative regulation through cell surface molecules plays an important role in regulating cell activity and migration (see review by Ravetch and Lanier, Nature October 2000; 290: 84-89).

[0012] If cell growth and activation go unregulated, diseases such as cancer occur. Recently, more and more data support CD72 as a negative regulator of B cell activation. (Pan C., et.al., Immunity. October 1999;11(4):495-506, Adachi T., et al., J. Immunol. May 15, 1998;160(10):4662-5., Adachi T., et al, J. Immunol. Feb. 1, 2000;164(3):1223-9.). CD72 mutations, however, have never been implicated in any diseases characterized by the hyperactivation or hyperpoliferation of cells.

[0013] Identification of CD72 mutations found in various diseases would not only offer a strategy for prospectively identifying and treating at risk patients based on their genotype, it would also provide important insights into the molecular mechanisms of this genetic polymorphism.

SUMMARY OF THE INVENTION

[0014] The present invention is based on the observation that a CD72 gene containing mutations and encoding a corresponding mutant protein may be correlated with the presence of diseases such as cancer, systemic lupus erythematosus, allergy, autoimmune disease, Epstein-Barr virus infection, cytomegalovirus infection and papillomavirus infection.

[0015] According to the present invention, there is provided a method for the diagnosis of diseases associated with a CD72 mutation in an individual, comprising detecting a mutation in the human CD72 gene, its CD72 mRNA, or in the CD72 protein, wherein the presence of the mutation is indicative of the disease.

[0016] It is another object of the invention to provide antibodies that bind to mutant CD72 proteins.

[0017] It is another object of the invention to provide primers for detecting and amplifying a region of DNA which contains the CD72 mutations.

[0018] It is another object of the invention to provide probes for detecting a region of DNA which contains the CD72 mutations.

[0019] It is also an object of the present invention to provide a mutant CD72 gene or an expressed mutant CD72 protein for drug development, gene therapy and other uses to prevent or ameliorate the effects of or resulting from the mutant CD72 gene.

BRIEF DESCRIPTION OF THE DRAWINGS

[0020]FIG. 1 depicts the sources of cDNA libraries which were used to screen for CD72 mutations.

[0021]FIG. 2 depicts RT-PCR of CD72 mRNA isolated from NZM mice demonstrating that NZM mice only express the B2 form of CD72. mRNA from the spleens of NZM mice were purified using RNeasy Midi Kits (Qiagen Inc., Valencia, Calif.). A set of primers were designed with the following sequences: 5′ primer AGAGGCGCCCAGGGCTAT (SEQ. ID. NO:23); 3′ primer CCCTCCCCTGACCCATCTCTA (SEQ. ID. NO:24). First round of RT-PCR was conducted using Titan One Tube RT-PCR (Roche Diagnostics GmbH, Germany). A second round of PCR utilized PCR Core System Kits from Promega Co. (Madison, Wis.). Annealing temperature set at 60° C., Mg++ concentration was at 1.5 mM. The products were analyzed in 1% agarose gel. Lane 1 is the PCR product from NZM mice. Lane 2 is the Φ X 174 Hae III fragment with molecular sizes at 1353 bp, 1078 bp, 873 bp, 603 bp.

[0022]FIG. 3 depicts a Western blotting analysis of the tissue homogenates from endometrial carcinoma tissue (Lane 1) and normal surrounding tissue (Lane 2) from the same patient. The homogenates were prepared with lysis buffer containing SDS. The samples were subjected to 10% SDS-PAGE gel under reducing conditions. After electrophoresis, the proteins were transferred to nitrocellulose membrane. The membrane was blocked with 3% non-fat milk from Pierce Co (Rockford, Ill.). Polyclonal rabbit anti-CD72 (Santa Cruz Biotech) was used as the first antibody for blotting of 60 minutes at 25° C. The membranes were washed 5 times with phosphate buffered saline containing 0.05% Tween-20. HRP-conjugated goat anti-rabbit antibody (Pierce) was then added as the second antibody. The membranes were then washed 5 times with phosphate buffered saline containing 0.05% Tween-20. West femto chemiluminescence kits from Pierce Co. (Rockford, Ill.) were used to develop the bands as shown according to the directions on the kit. Protein loading for Lane 2 is 50% higher than Lane 1 as determined by BCA kit from Pierce Co. (Rockford, Ill.).

[0023]FIG. 4 depicts a Western blotting analysis of tissue homogenates from renal tumors (Lane A2, Lane A4, Lane B2, Lane B4 and Lane B6) and normal surrounding tissues (Lane A1, Lane A3, Lane B 1, Lane B3 and Lane B5) from the same patients. These tissue samples were prepared with lysis buffer containing NP-40. The samples were subjected to 10% SDS-PAGE gel under reducing conditions. After electrophoresis, the proteins were transferred to nitrocellulose membrane. The membrane was blocked with Superblock from Pierce Co. (Rockford, Ill.). Polyclonal rabbit anti-CD72 (Santa Cruz Biotech, Santa Cruz, Calif.) was used as the first antibody for blotting of 60 minutes at 25° C. The membranes were washed 5 times with phosphate buffered saline containing 0.05% Tween-20. HRP-conjugated goat anti-rabbit antibody (Pierce) was then added as the secondary antibody. The membranes were then washed 5 times with phosphate buffered saline containing 0.05% Tween-20. West femto chemiluminescence kits from Pierce Co. (Rockford, Ill.) were used to develop the bands as shown according to the directions on the kit.

[0024]FIG. 5 depicts a Western blotting analysis of CD72 in endometrial carcinoma tissues. Tissue homogenates from endometrial carcinoma tissues (Lane 2, Lane 4, and Lane 5) and normal surrounding tissues (Lane 1 and Lane 3) were prepared with lysis buffer containing NP-40. The Western blotting procedures of FIG. 4 were used to detect CD72.

[0025]FIG. 6 depicts a Western blotting analysis of cell lysates from ECC-1 cells (Lane 5), and tissue homogenates from ovarian tumor tissues (Lane 2 and Lane 3) and normal surrounding tissues (Lane 1 and Lane 4) from the same patients. These samples were prepared with lysis buffer containing NP-40. The samples were subjected to 10% SDS-PAGE gel under reducing conditions. After electrophoresis, the proteins were transferred to nitrocellulose membrane. The membrane were blocked with 3% non-fat milk. Polyclonal rabbit anti-CD72 (Santa Cruz Biotech, Santa Cruz, Calif.) was used as the first antibody for blotting of 60 minutes at 25° C. The membranes were washed 5 times with phosphate buffered saline containing 0.05% Tween-20. HRP-conjugated goat anti-rabbit antibody (Pierce Co., Rockford, Ill.) was then added as the secondary antibody. The membranes were then washed 5 times with phosphate buffered saline containing 0.05% Tween-20. West femto chemiluminescence kits from Pierce Co. (Rockford, Ill.) were used to develop the bands as shown according to the directions on the kit.

[0026]FIG. 7 depicts a PCR analysis of the CD72 gene of ECC-1 Cells and a tumor sample. Genomic DNA from the ECC-1 cell line (Lane 2 and Lane 4) and a tumor sample (Lane 3) were analyzed by PCR using two sets of primers; ITIM primers (SEQ. ID. NO.: 19 and 20) (Lane 2) and EXON89 primers (SEQ. ID. NO.: 25 and 26) (Lane 3 and Lane 4). The PCR products were subjected to 1% agarose gel electrophoresis. Lane 1 is the 100 bp PCR marker. The PCR product from a normal CD72 gene would have 729 nucleotides for the ITIM primers or 830 nucleotides for the EXON89 primers. If there is mutation in the primer regions, there will be no amplification. If there is deletion or insertion between the primers, the PCR product will have a shorter or larger size.

[0027]FIG. 8 depicts a Western blotting analysis of tissue homogenates from lymphomas (Lane 1-3) and endometrial carcinoma tissues (Lane 4 and Lane 5). Homogenates were prepared with lysis buffer containing SDS. Equal amounts of protein were loaded in each lane as determined by BCA protein kit from Pierce. The same procedures used to detect CD72 in FIG. 3 were employed. Lane 1 is a sample from tumor tissue of Hodgkin's lymphoma. Lane 2 is a sample of lymphoma. Lane 3 is a sample from a B cell lymphoma of large cell type. Lane 4 and Lane 5 are samples from endometrial carcinoma.

[0028]FIG. 9 depicts a Western blotting analysis of protein homogenates from spleen of NZM and C57BL6/J mice. Equal amounts of protein lysates were analyzed by Western blotting as in FIG. 3. Lane M contains homogenates from NZM mice. Lane C contains homogenates from C57BL/6J mice. Panel B represents a shorter exposure of a membrane with less efficiency of transfer than in Panel A.

[0029]FIG. 10 depicts a Western blotting analysis of protein homogenates from the spleen of NZM and C57BL6/J mice. Equal amounts of protein lysates from spleen of NZM (M) and C57BL6/J (C) were analyzed by Western blotting as in FIG. 3 except that the first antibody was a monoclonal antibody against CD72 from Santa Cruz Biotech Inc. (Santa Cruz, Calif.) and a HRP-conjugated goat anti-mouse antibody from Pierce Co. (Rockford, Ill.) was used as the secondary antibody. Lane M contains homogenates from NZM mice. Lane C contains homogenates from C57BL/6J mice.

[0030]FIG. 11 depicts the DNA sequencing profile of a patient with severe lupus. Blood was drawn from a patient with severe lupus. The patient has symptoms in brain, kidney and joints. Leukocytes were obtained by Ficoll-Hypaque centrifugation. Genomic DNA was prepared using the Qiagen DNeasy Kit according to manufacturer's instructions(Valencia, Calif.). ITIM primers (SEQ. ID. NOS.: 19 and 20) were used in the PCR amplification. The PCR products were run on 1% agarose gel for 1 hour. The DNA band was cut out from the gel and purified using Qiagen gel kit (Valencia, Calif.). Purified DNA was sent to University of Michigan Sequence Core for sequencing using ITIM forward primer (SEQ. ID. NO. 19) as the sequencing primer. The region that has mutations in the lupus patient is shown.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0031] Four alleles of the mouse CD72 sequences are available in Genbank. Analysis of the CD72 locus of house mouse reveals polymorphism in CD72 between common inbred mice strains. However, the cytoplasmic domain of mouse CD72 is highly conserved between these strains, particularly in the two ITIM domains. The cytoplasmic domain of CD72 is also highly conserved between humans and mice. The normal human CD72 gene sequence has been reported by the Human Genome Project (SEQ. ID. NO:1). When the normal human CD72 gene was analyzed, 9 exons were found to code for CD72 protein, which is similar to mouse CD72 gene.

[0032] Mutant CD72 sequences can be identified by screening sequences deposited and available in public databases such as GenBank or other private sequence databases. Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across the full-length sequence can be determined through sequence alignment using computer software programs such as BLAST, BLAST-2, ALIGN, DNAstar, and INHERIT which employ various algorithms to measure homology.

[0033] The BLAST approach, as detailed in Karlin et al., Proc. Nat'l. Acad. Sci. USA 90:5873-5787 (1993), and incorporated herein by reference, searched for matches between a query sequence and a database sequence. BLAST evaluated the statistical significance of any matches found, and reported only those matches that satisfy the user-selected threshold of significance. In this application, the threshold was set at 10 for nucleotide searches.

[0034] A search of Genbank revealed more than 100 sequences or expressed sequence tags (EST) labeled as human CD72. The majority of these sequences came from the Integrated Molecular Analysis of Genomes and their Expression (“IMAGE”) Consortium clones (Lennon, G. G., et., al., Genomics 33:151-152 1996). Forty of these clones were obtained from the IMAGE consortium, distributed by Incyte Genomics Inc. (St. Louis, Mo.). These clones had been partially sequenced and all were claimed to match the human CD72 gene.

[0035] The IMAGE cDNA clones obtained from Incyte Genomics are in tranformed E. coli. Bacteria containing the cDNA from these clones was streaked onto LB plates with ampicillin and incubated overnight at 37° C. A well-isolated clone was then inoculated into 10 ml of LB broth containing ampicillin and cultured in a shaker overnight at 37° C. Plasmids were prepared using Qiagen mini-prep Kits (Qiagen Inc., Valencia, Calif.) as indicated by the manufacturers directions. Purified plasmids were sent to University of Michigan DNA Sequencing Core for DNA sequencing by Sanger's method, which is well known in the art.

[0036] The DNA sequences were compared to the sequences present in Genbank. Almost half of the clones matched to the human testis specific kinase 1, not CD72. All the clones that contained the human CD72 gene showed highly conserved nucleotide sequences. Thirteen clones from different cDNA libraries have shown almost 100% conservation in nucleotide sequence. The cDNA libraries were prepared from 17 different human pools. (FIG. 1).

[0037] By alignment and comparison of the clone's DNA sequences with the normal CD72 cDNA sequence, a cDNA clone from thyroid follicular carcinoma tissue was found to have a mutation in the cytoplasmic domain of human CD72 (SEQ. ID. NO:5). This mutation is the result of a truncation in the cDNA. A new segment of cDNA has replaced the segment of human C72 cDNA coding for the first ITIM domain in human CD72 protein. Codons 1 to 189 are deleted from the normal CD72 cDNA. The wild-type CD72 cDNA sequence is depicted in SEQ. ID. NO:2. An insertion of 452 base pairs at this site results in the mutant CD72 cDNA sequence. (SEQ. ID. NO:5.) These mutations result in a mutant CD72 protein with the sequence of SEQ. ID. NO:6. Without the ITIM domain, this region of CD72 can not be phosphorylated and thus, can not recruit phosphatase SHP-1. Therefore, the mutated CD72 can not function properly as a negative regulator because the recruitment of phosphatase SHP-1 is responsible for dephosphorylation events in signal transduction.

[0038] A BLAST of the mutant cDNA sequence of SEQ. ID. NO:5 against Chromosome 9 (Genbank Accession No. AL357874) reveals a deletion between codons 843 and 1462 of the genomic CD72 sequence (SEQ. ID. NO:1) which includes Exon 1, resulting in the intronic sequence of codons 716 to 842 being transcribed into mRNA. The new sequence at the 5′ end of the mutated genomic sequence matches Chromosome 9 (AL357874) codons 54511 to 54950, while the rest of the sequence starts at codon 26973 of AL357874. This suggests a 27,537 base pair gap in the genomic sequence. This suggests an additional deletion or translocation of chromosomal DNA in the CD72 gene region, resulting in the mutated genomic CD72 sequence of SEQ. ID. NO:4.

[0039] In a cDNA clone from renal cancer, also obtained from the IMAGE consortium, an insertion of 225 bp was found which included part of the intron between Exon 3 and Exon 4, an intron which should have been spliced during mRNA maturation. (SEQ. ID. NO:7). This mutation results in the mutated CD72 protein sequence of SEQ. ID. NO:8. This insertion is in the cytoplasmic domain. As the cDNA in this clone is not long enough, it is uncertain whether the ITIM domains are intact. However, the insertion introduces a stop codon at codon 137 of SEQ. ID. NO:7, thus no mature protein can be translated.

[0040] Western blotting analysis of 5 renal cancer tumor samples revealed an increased expression of CD72 in one tumor sample when compared to the normal surrounding tissue. (FIG. 4A, Lane 2). In another tumor sample, a decreased expression of CD72 was observed (FIG. 4 A, Lane 4). In addition, a new 40 kd protein band was expressed or significantly increased, similar to the 40 kd protein band observed in endometrial adenocarcinoma and ovarian cancer described below (FIG. 4). Most likely, this 40 kd protein is a truncated CD72.

[0041] Several CD72 mutations have also been found in cDNA clones obtained from a pool of seven endometrial adenocarinoma cDNAs. The intron between Exon 4 and Exon 5 (codons 2986-3306 of SEQ. ID. NO:1) is not processed. This intron contains two stop codons (located at codons 237 and 243 of SEQ. ID. NO. 10) that are in-frame with the normal CD72 open reading frame. Thus a mature CD72 protein can not be translated. In addition, a deletion of 61 bp between codons 1213-1273 of the normal CD 72 cDNA (SEQ. ID. NO:2) was found. The deletion is not in the open reading frame. In addition, there is a point mutation (A to T) at codon 233. These mutations result in a mutated CD72 cDNA (SEQ. ID. NO:10). The mutated genomic sequence has the sequence of SEQ. ID. NO:9.

[0042] Western blotting analysis of an endometrial adenocarcinoma tumor revealed that normal CD72 protein was not expressed. In one tumor sample, no CD72 was expressed (FIG. 8 Lane 4). In another tumor sample, a lower molecular weight 40 kd protein was detected at a significantly higher amount.(FIG. 5, Lane 5). In addition, an endometrial adenocarcinoma cell line (ECC-1) also expressed a CD72 protein with a molecular weight of 40 kd, lower than the normal molecular weight of 45 kd. (FIG. 6, lane 5). In order to analyze the mutation in the ECC-1 cell line, primers were designed to amplify the first two exons (ITIM primers, SEQ. ID. NOs.: 19 and 20) and the last two exons (EXON89 primers; SEQ. ID. NOs.: 25 and 26) of the CD72 gene. (FIG. 7). PCR analysis using the ITIM primers (SEQ. ID. Nos.: 19 and 20) revealed a band at the predicted size. (FIG. 7, Lane 2). However, when the EXON89 primer was used, a product at the predicted size was not detected (FIG. 7, Lane 4) although the positive control gave a band at the predicted size (FIG. 7, Lane 3). This result has been achieved in numerous experiments. Thus, the EXON89 primer is useful in identifying mutations in the CD72 gene.

[0043] Similarly, Western Blotting analysis of two ovarian cancer tumor samples demonstrated the expression of a 40 kd protein while there is no expression of this protein in normal surrounding tissue from the same patient. (FIG. 6). Western Blotting analysis of tumors from B-Cell lymphoma and lymphoma reveal an absence of any CD 72 protein. (FIG. 8, Lanes 2 and 3).

[0044] A BLAST search of Genbank also revealed four additional mutant CD72 sequences from pre-B cell leukemia (acute lymphocytic leukemia). These sequences are from clones with Genbank Accession Nos. AF283777, BE243553, BM193456 and BE244747). Three of these sequences: AF283777 (SEQ. ID. NO.: 11), BM193456 (SEQ. ID. NO.: 13) and BE244747 (SEQ. ID. NO.: 14) have a complete 5′ sequence. These three clones have a deletion of codons 141-190 of the normal CD 72 cDNA (SEQ. ID. NO.: 2). This deletion is most likely the result of alternative splicing (the deleted fragment begins with the sequence GT.) The deletion of codons 141-190 results in a frame shifting mutation and the generation of two consecutive stop codons. Thus, no protein product is produced. The other clone, BE243553 (SEQ. ID. NO.: 12), only contains the 3′ sequence, thus one cannot determine if codons 141-190 have been deleted. However, this clone revealed a mutation in which Intron 8 was transcribed. In addition to the deletion of codons 141-190, BM 193456 also has a serial of 8 thymidines inserted after the start codon, resulting in a frame-shifting mutation. The predicted genomic sequence for BM 193456 is shown in SEQ. ID. NO.: 15.

[0045] A cDNA sequence from chondrosarcoma was analyzed. A mutant cDNA CD72 sequence was identified (SEQ. ID. NO.:17). Codons 1255-1323 of the normal CD72 cDNA were deleted, resulting in the sequence of SEQ. ID. NO.: 17. This deletion of 69 bp also occurs in the 3′ nontranslated region of the gene but at a different location than the deletion identified in the cDNA from endometrial adenocarcinoma as described above. The genomic sequence of the mutant chondrosarcoma CD 72 gene is shown in SEQ. ID. NO.: 18.

[0046] DNA encoding CD72 mutants can be obtained from any cDNA libraries prepared from tissue believed to possess the mutant gene and to express it at a detectable level. Accordingly, human CD72 mutants can be conveniently obtained from a cDNA library prepared from human tissue, such as described in Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989). Mutant CD72 genes may also be obtained from a genomic library.

[0047] Libraries can be screened with probes (such as antibodies to a mutant CD72 sequence) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA or genomic library with the selected probe may be conducted using standard procedures, such as described in Sambrook et al., supra. An alternative means to isolate CD72 mutant genes is to use PCR methodology [Sambrook et al., supra; Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)].

[0048] When screening a cDNA library, the oligonucleotide sequences selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library being screened. Methods of labeling are well known in the art, and include the use of radiolabels like .sup.32 P-labeled ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high stringency, are provided in Sambrook et al., supra.

[0049] The polymorphic expression of CD72 in mouse strains not only results from the expression of different alleles in different mouse strains, but also results from the differential mRNA splicing in the same mouse strain. Allele b of CD72 in Balb/c, C57BL/6 has three differentially spliced mRNA forms. One form with the complete sequence is expressed on the cell surface (B1 form). Another form does not contain exon 3 and exon 4 nucleotides including the trans-membrane domain (coded by Exon 4). Thus this form is not expressed on the cell surface (B3 form). One other form does not contain exon 3, which encodes a portion of the cytoplasmic domain. This form still contains two ITIM domains and is expressed on the cell surface (B2 form).

[0050] The NZM mouse strain is an animal model for human lupus erythematosus. The mRNA expression of these CD72 forms was analyzed in this mouse strain. Spleen mRNA from NZM mice was purified using RNeasy Midi Kits (Qiagen Inc., Valencia, Calif.). Primers were designed with the following sequences: 5′ primer AGAGGCGCCCAGGGCTAT (SEQ. ID. NO:23); 3′ primer CCCTCCCCTGACCCATCTCTA (SEQ. ID. NO:24). A first cycle of RT-PCR was conducted using Titan One Tube RT-PCR (Roche Diagnostics GmbH, Germany). A second cycle of PCR was then conducted with the same set of primers used PCR Core System Kits from Promega Co. (Madison, Wis.). Three bands corresponded to the size of CD72 mRNA were predicted for CD72 allele b (1.32 for full length DNA, 1.25 for exon 3 deleted DNA, 1.16 kb for exon 3 and exon 4 deleted DNA). In agarose gel analysis, unexpectedly, only one band was observed for NZM mice (FIG. 2). Thus, only one form of mRNA is expressed. This DNA fragment was separated by agarose gel and excised from the gel, centrifuged through a GenElute PCR DNA purification column (Sigma Co., St. Louis, Mo.), and the eluate was purified using GenElute PCR DNA purification kit. The purified fragment was ligated to a pGEM-T Easy vector from Promega Corporation (Madison, Wis.). Seven clones containing the fragment were sent to the University of Michigan for DNA sequencing.

[0051] Sequence analysis demonstrated that the NZM mouse strain only expresses the B2 form CD72 (without exon 3)(SEQ. ID. NO:21). Other point mutations are most likely the result of PCR errors. This results in the protein sequence of SEQ. ID. NO: 22.

[0052] Protein lysates from mouse spleen were analyzed by Western blotting. (FIGS. 9 and 10). Mice express at least 3 types of CD72 protein products as detected by antibody against the cytoplasmic domain of the protein. The majority of the protein detected by Western Blotting are bands of 26 kd and 28 kd (FIG. 9A). The density of the 26 kd and 28 kd proteins correlates well with the amount of CD72 expressed on the cell surface (the long forms). The 28 kd protein is a truncated form of the CD72 protein with a full length cytoplasmic domain while the 26 kd protein is a truncated CD72 with a 24 amino acid deletion in the cytoplasmic domain. NZM mice express a higher amount of CD72; normal strain C57BL/6J (CD72 allele) mice express the normal lower amount of CD72. The majority of full length CD72 expressed in C57BL/6J is 50 kd protein, 42 kd CD 72 protein being the minor. In contrast, NZM mice express mainly the 42 kd protein, the 50 kd form being the minor (FIG. 10). The results are consistent with the PCR analysis.

[0053] Twenty-three samples of genomic DNA from 23 normal individuals (without lupus symptoms) were analyzed by PCR analysis. The ITIM primers (SEQ. ID. NO.: 19 and SEQ. ID. NO.: 20) were used to amplify the 5′ un-transcribed region and Exons 1 and 2. No mutations in CD72 were observed. However, a patient exhibiting lupus symptoms in brain, kidney and joints, demonstrated a mutation in this region of CD72. The mutation is in the 5′ un-transcribed region. In normal patients, codons 993-994 are G-G (SEQ. ID. NO. :2). In the mutated CD72, codon 993 was either A/C/G and codon 994 was either T/G, depending on the allele (FIG. 11). The 23 normal samples represent 46 alleles in which there was no mutation.

[0054] Using methods such as those described herein, or other appropriate methods, it is now possible to diagnose diseases associated with the expression of a mutant form of CD72 by detecting the mutation or mutations in the CD72 gene that are associated with said disease.

[0055] The genomic DNA used for the diagnosis may be obtained from body cells, such as those present in peripheral blood, urine, saliva, bucca, surgical specimen, and autopsy specimens. The DNA may be used directly or may be amplified enzymatically in vitro through use of PCR (Saiki et al. Science 239:487-491 (1988)) or other in vitro amplification methods such as the ligase chain reaction (LCR) (Wu and Wallace Genomics 4:560-569 (1989)), strand displacement amplification (SDA) (Walker et al. PNAS USA 89:392-396 (1992)), self-sustained sequence replication (3SR) (Fahy et al. PCR Methods Appl. 1:25-33 (1992)), prior to mutation analysis. The methodology for preparing nucleic acids in a form that is suitable for mutation detection is well known in the art.

[0056] The detection of mutations in specific DNA sequences, such as the CD72 gene, can be accomplished by a variety of methods including, but not limited to, restriction-fragment-length-polymorphism detection based on allele-specific restriction-endonuclease cleavage (Kan and Dozy Lancet II:910-912 (1978)), hybridization with allele-specific oligonucleotide probes (Wallace et al. Nucl Acids Res 6:3543-3557 (1978)), including immobilized oligonucleotides (Saiki et al. PNAS USA 86:6230-6234 (1989)) or oligonucleotide arrays (Maskos and Southern Nucl Acids Res 21:2269-2270 (1993)), allele-specific PCR (Newton et al. Nucl Acids Res 17:2503-25 16 (1989)), mismatch-repair detection (MRD) (Faham and Cox Genome Res 5:474-482 (1995)), binding of MutS protein (Wagner et al. Nucl Acids Res 23:3944-3948 (1995)), denaturing-gradient gel electrophoresis (DGGE) (Fisher and Lerman et al. PNAS USA 80:1579-1583 (1983)), single-strand-conformation-polymorphism detection (Orita et al. Genomics 5:874-879 (1983)), RNAase cleavage at mismatched base-pairs (Myers et al. Science 230:1242 (1985)), chemical (Cotton et al. PNAS USA 85:4397-4401 (1988)) or enzymatic (Youil et al. PNAS USA 92:87-91 (1995)) cleavage of heteroduplex DNA, methods based on allele specific primer extension (Syvanen et al. Genomics 8:684-692 (1990)), genetic bit analysis (GBA) (Nikiforov et al. Nuci Acids Res 22:4167-4175 (1994)), the oligonucleotide-ligation assay (OLA) (Landegren et al. Science 241:1077 (1988)), the allele-specific ligation chain reaction (LCR) (Barrany PNAS USA 88:189-193 (1991)), gap-LCR (Abravaya et al. Nucl Acids Res 23:675-682 (1995)), and radioactive and/or fluorescent DNA sequencing using standard procedures well known in the art.

[0057] As will be appreciated, the mutation analysis may also be performed on samples of RNA by reverse transcription into cDNA therefrom. Furthermore, mutations may also be detected at the protein level using antibodies specific for the mutant and normal CD72 protein, respectively. It may also be possible to base an CD72 mutation assay on altered cellular or subcellular localization of the mutant form of the CD72 protein.

[0058] In a first method of diagnosing diseases associated with a CD72 mutation, hybridization methods such as Southern analysis, are used (see Current Protocols In Molecular Biology, Ausubel, F. et al., Eds., John Wiley & Sons, including all supplements). For example, a test sample of genomic DNA, RNA or cDNA, is obtained from an individual suspected of having (or carrying a defect for) a CD72 associated disease (the “test individual”). The individual can be an adult, child, or fetus. The test sample can be from any source which contains genomic DNA, such as a blood or tissue sample, such from skin or other organs. In a preferred embodiment, the test sample of DNA is obtained from a fibroblast skin sample, from hair roots, or from cells obtained from the oral cavity, e.g., via mouthwash. In another preferred embodiment, the test sample of DNA is obtained from fetal cells or tissue by appropriate methods, such as amniocentesis or chorionic villus sampling. The DNA, RNA, or cDNA sample is examined to determine whether one of the mutations described above is present; the presence of the mutation is indicated by hybridization of the CD72 gene in the genomic DNA, RNA, or cDNA to a nucleic acid probe. A “nucleic acid probe,” as used herein, can be a DNA probe or an RNA probe. The nucleic acid probe hybridizes to at least one of the mutations described above. A fragment of such a nucleic acid probe can also be used, provided the fragment hybridizes to the part of the CD72 gene that contains the mutation.

[0059] To diagnose CD72 associated diseases by hybridization, a hybridization sample is formed by contacting the test sample containing the CD72 gene with a nucleic acid probe. The hybridization sample is maintained under conditions which are sufficient to allow hybridization of the nucleic acid probe to the CD72 gene. Hybridization can be preformed under high stringency conditions or moderate stringency conditions, for example. “Stringency conditions” for hybridization is a term of art which refers to the conditions of temperature and buffer concentration which permit hybridization of a particular nucleic acid to another nucleic acid in which the first nucleic acid may be perfectly complimentary to the second, or the first and second nucleic acids may share only some degree of complementarity. For example, certain high stringency conditions can be used which distinguish perfectly complementary nucleic acids from those of less complementarity. “High stringency conditions” and “moderate stringency conditions” for nucleic acid hybridizations are explained in Current Protocols In Molecular Biology, supra, the teachings of which are hereby incorporated by reference. The exact conditions which determine the stringency of hybridization depend on factors such length of nucleic acids, base composition, percent and distribution of mismatch between the hybridizing sequences, temperature, ionic strength, concentration of the stabilizing agents, and other factors. Thus, high or moderate stringency conditions can be determined empirically. In one embodiment, the hybridization conditions for hybridization are moderate stringency. In a particularly preferred embodiment, the hybridization conditions for hybridization are high stringency.

[0060] Hybridization, if present, is then detected using standard methods. A hybridization occurs between the nucleic acid probe and the CD72 gene in the test sample, and the CD72 gene has a mutation corresponding to the nucleic acid probe utilized. More then one nucleic acid probe can also be concurrently used in this method. Hybridization of any one of nucleic acid probes is indicative of a mutation that is associated with the mutant CD72 associated diseases, and is therefore diagnostic for one of these diseases. For example, in the diagnosis of thyroid follicular carcinoma, a nucleic acid probe can be prepared that hybridizes to a part of the mutation of the CD72 gene giving rise to the deletion in the first ITIM structure. If this nucleic acid probe hybridizes with the CD72 gene in the test sample, a diagnosis of thyroid follicular carcinoma is made. Alternatively, a nucleic acid probe can be prepared that hybridizes to a CD72 gene having one of the other mutations described above. Hybridization of such a nucleic acid probe with the CD72 gene in the test sample is indicative of the disease associated with that particular mutation.

[0061] In another hybridization method, Northern analysis (see Current Protocols In Molecular Biology, supra) is used to identify the presence of a mutation associated with a mutant CD72 associated disease. For Northern analysis, a sample of RNA is obtained from the test individual by appropriate means. Hybridization of a nucleic acid probe, as described above, to RNA from the individual is indicative of a mutation that is associated with the mutant CD72 associated disease, and is therefore diagnostic for that disease.

[0062] The newly developed technique of nucleic acid analysis via microchip technology is also applicable to the present invention. In this technique, literally thousands of distinct oligonucleotide probes are built up in an array on a silicon chip. Nucleic acid to be analyzed is fluorescently labeled and hybridized to the probes on the chip. It is also possible to study nucleic acid-protein interactions using these nucleic acid microchips. Using this technique one can determine the presence of mutations or even sequence the nucleic acid being analyzed or one can measure expression levels of a gene of interest. The method is one of parallel processing of many, even thousands, of probes at once and can tremendously increase the rate of analysis. Several papers have been published which use this technique. Some of these are Hacia et al., Nature Genetics 14, December 1996; Shoemaker et al., 1996; Chee et al., 1996; Lockhart et al., 1996; DeRisi et al., 1996; Lipshutz et al., 1995. This method has already been used to screen people for mutations in the breast cancer gene BRCA1 (see Hacia et al., supra). This new technology has been reviewed in a news article in Chemical and Engineering News (Borman, 1996) and been the subject of an editorial (Nature Genetics, 1996).

[0063] In another method of the invention, mutation analysis by restriction digestion can be used to detect a mutation, if the mutation in the gene results in the creation or elimination of a restriction site. A test sample containing genomic DNA is obtained from the test individual. Polymerase chain reaction (PCR) or ligase chain reaction (LCR) can be used to amplify the CD72 sequence (and, if necessary, the flanking sequences) in a test sample of DNA from the test individual. RFLP analysis is conducted as described: see Current Protocols In Molecular Biology, supra. The digestion pattern of the relevant DNA fragment indicates the presence or absence of the mutation associated with the mutant CD72 associated disease.

[0064] In another method of the invention, PCR is utilized to detect the mutant gene or a lack thereof. Primers having a sequence complimentary to the sequence on either side of the mutated sequence are used to amplify the DNA or RNA (if RNA is being detected, a reverse transcriptase stage must be performed, as would be apparent to one skilled in the art) containing the mutation. Wherein the mutation results in a measurable size difference in the amplified product, the presence of the mutation can be detected by gel electrophoresis. The advantages of using the PCR reaction is that the actual mutated sequence is obtained, less starting material is required and the PCR methods allow quantitative as well as qualitative determinations to be made. Quantitative determinations allow the number of copies of a mutated gene present in a particular sample to be estimated, and given this information the severity of the disease state can be estimated.

[0065] Another alternative method for detecting the presence of the mutant gene is one in which one primer has a complimentary sequence encompassing the mutation. Amplification will therefore only occur if the mutated sequence is present. Newton et al., Nucl. Acids. Res. 17:2503 (1989). The method has previously been used in detecting mutations in the gene responsible for cystic fibrosis, and one skilled in the art could easily perform this test for the detection of the mutant gene of the present invention.

[0066] Sequence analysis can also be used to detect specific mutations in the CD72 gene. A test sample of DNA is obtained from the test individual. PCR or LCR can be used to amplify the gene, and/or its flanking sequences. The sequence of the mutant CD72 gene, or a fragment of the gene, is determined, using standard methods. The sequence of the gene (or gene fragment) is compared with the known nucleic acid sequence of the gene. The presence of any of the mutations associated with the mutant CD72 associated disease indicates that the individual is infected with, or is a carrier for that particular disease.

[0067] Analysis of the protein product of the mutant gene can also be used to detect specific mutations. With the biological molecule to be analyzed as a protein, it may be desirable to release the nucleic acid from the biological sample cells prior to protein elusion, or to remove nucleic acid from the sample eluate prior to protein analysis, thus, the sample or eluate may first be treated to release or remove the nucleic acid by mechanical disruption (such as freeze/thaw, abrasion, sonication), physical/chemical disruption such as treatment with detergents, osmotic shock, heat, enzymatic digection or nucleus treatment, all according to well known methods in the art.

[0068] Where a biological sample includes a mutant protein, the presence or absence of which is indicative of a genetic disease, the protein may be detected using conventional detection methods, for example, using protein-specific probes such as an antibody probe. Additionally, absence of the native normal protein is also indicative of a mutant gene. As such, where a genetic disease correlates with the presence or absence of an amino acid or sequence of amino acids, these amino acids may be detected using conventional means, e.g., an antibody which is specific for the native or mutant sequence.

[0069] Any of the antibody reagents useful in the method of the present invention may comprise whole antibodies, antibody fragments, polyfunctional antibody aggregates, or in general any substance comprising one or more specific binding sites from an antibody. The anti fragments may be fragments such as Fv, Fab and F(ab′).sub.2 fragments or any derivatives thereof, such as a single chain Fv fragment. The antibodies or antibody fragments may be non-recombinant, recombinant or humanized. The antibody may be of any immunoglobulin isotope, e.g., IgG, IgM and so forth. In addition, aggregates, polymers, derivatives and conjugates of immunoglobulins or their fragments can be used where appropriate. The immunoglobulin source for an antibody reagent can be obtained in any manner such as by preparation of a conventional polyclonal antiserum or by preparation of a monoclonal or a chimeric antibody. Antiserum can be obtained by well-established techniques involving immunization of an animal, such as a mouse, rabbit, guinea pig or goat, with an appropriate immunogen.

[0070] For the production of polyclonal antibodies, the peptide or polypeptide may be conjugated to a conventional carrier in order to increase its immunogenicity, and antisera to the peptide-carrier conjugate is raised in rabbits. Coupling of a peptide to a carrier protein and immunizations are performed as described (Dymecki, S. M. et al., J. Biol. Chem., 267:4815-4823 (1992)). Rabbit antibodies against this peptide are raised and the sera titered against peptide antigen by ELISA or alternatively by dot or spot blotting (Boersma and Van Leeuwen, J. Neuroscience Methods, 51:317 (1994)). At the same time, the antiserum may be used in tissue sections. The sera is shown to react strongly with the appropriate peptides by ELISA, following the procedures of Green et al., Cell, 28, 477-487 (1982). Preferably, the sera exhibiting the highest titer is subsequently used.

[0071] Techniques for preparing monoclonal antibodies are well known, and monoclonal antibodies of this invention may be prepared using a synthetic peptide, preferably bound to a carrier, as described by Arnheiter et al., Nature, 294, 278-280 (1981).

[0072] Monoclonal antibodies are typically obtained by hybridoma tissue cultures or from ascites fluid obtained from animals into which the hybridoma tissue was introduced. Nevertheless, monoclonal antibodies may be described as being “raised to” or “induced by” the synthetic peptides or their conjugates.

[0073] Particularly preferred immunological tests rely on the use of either monoclonal or polyclonal antibodies and include enzyme linked immunoassays (ELISA), immunoblotting, immunoprecipitation and radio immunoassays (RIA). (See Voller, A., Diagnostic Horizons, 2:1-7, 1978, Microbiological Associates Quarterly Publication, Walkersville, Md.; Voller, A. et al., J. Clin. Pathol., 31:507-520 (1978); U.S. Reissue Pat. No. 31,006; Butler, J. E., Meth. of Enzymol., 73:482-523 (1981); Maggio, E., Enzyme Immunoassay, CRC Press, Boca Raton, Fla. (1980); Weintraub, B., Principals of Radioimmunoassays, 7^(th) Training Course on Radio Ligandassay Techniques, The Endocrine Society, March 1986, pp. 1-5, 46-49 and 68-78). For analyzing tissues for the presence of the mutant protein of the present invention, immunohistochemistry techniques are preferably used. It will be apparent to one skilled in the art that the antibody molecule will have to be labeled to facilitate easy detection of mutant protein. Techniques for labeling antibody molecules are well known to those skilled in the art. (See Harlour and Lane, Antibodies, Cold Spring Harbor Laboratory, pp. 1-726 (1989).)

[0074] Alternatively, sandwich hybridization techniques may be used, for example, an antibody specific for a given protein. In addition, an antibody specific for a haptenic group conjugated to the binding protein can be used. Another sandwich detection system useful for detection is the avidin or streptavidin system, where a protein specific for the detectable protein has been modified by addition of biotin. In yet another embodiment, the antibody may be replaced with a non-immunoglobulin protein which has the property of binding to an immunoglobulin protein, for example, Staphylococcal protein A or Streptolococcal protein G, which are well known in the art. The protein may either itself be labeled or may be detected indirectly by a detectable labeled secondary binding protein, for example, a second antibody specific for the first antibody. Thus, if a rabbit-anti-hybrid wild-type/nonsense protein antibody serves as the first binding protein, a labeled goat-anti-rabbit immunoglobulin antibody would be a second binding protein.

[0075] In another embodiment, the signal generated by the presence of a hybrid wild-type/nonsense protein is amplified by reaction with a specific antibody for that fusion protein (e.g., an anti-beta galactosidase antibody) which is detectably labeled. One of ordinary skill in the art can devise without undue experimentation a number of such possible first and second binding protein systems using conventional methods well known in the art. Alternatively, other techniques can be used to detect the mutant proteins, including chromatographic methods such as SDS PAGE, isoelectric focusing, Western blotting, HPLC and capillary electrophoresis.

[0076] According to the present invention, a method is also provided of supplying wild-type CD72 function to a cell which carries mutant CD72 alleles. Supplying such function should suppress the hyperproliferation or hyperactivation of the recipient cells. The wild-type CD72 gene may be introduced into the cell in a vector such that the gene remains extrachromosomal. In such a situation, the gene will be expressed by the cell from the extrachromosomal location. If a gene portion is introduced and expressed in a cell carrying a mutant CD72 allele, the gene portion should encode a part of the CD72 protein which is required for non-neoplastic growth of the cell. More preferred is the situation where the wild-type CD72 gene or a part of it is introduced into the mutant cell in such a way that is recombines with the endogenous mutant CD72 gene present in the cell. Such recombination requires a double recombination event which results in the correction of the CD72 gene mutation. Vectors for introduction of genes both for recombination and for extra chromosomal maintenance are known in the art and any suitable vector may be used. Methods for introducing DNA into cells such as electroporation, calcium phosphate co-precipitation, and viral transduction are known in the art and the choice of method is within the competence of the practitioner. Cells transformed with the wild-type CD72 gene can be used as model systems to study cancer remission and drug treatments which promote such remission.

[0077] Polypeptides which have CD72 activity can be supplied to cells which carry mutant or missing CD72 alleles. The sequence of the CD72 protein is disclosed in SEQ. ID. NO:3. Protein can be produced by expression of the cDNA sequence, or a fragment of the sequence in bacteria, for example, using known expression vectors. Alternatively, CD72 can be extracted from CD72-producing mammalian cells such as pre-B cells. In addition, the techniques of synthetic chemistry can be employed to synthesize CD72 protein. Any of such techniques can provide the preparation of the present invention which comprises the CD72 gene product having the sequence shown in SEQ. ID. NO:3. The preparation is substantially free of other human proteins. This is most readily accomplished by synthesis in a microorganism or in vitro. Active CD72 molecules can be introduced into cells by microinjection or by use of liposomes, for example.

[0078] Alternatively, some such active molecules may be internalized by cells, actively or by diffusion. (See review by Ford et al. Gene Therapy 2001, Jan. 8(1): 1-4.) This process relies on the inherent property of a small number of proteins and peptides of being able to penetrate the cell membrane. The transducing property of these molecules can be conferred upon proteins which are expressed as fusions with them and thus offers an alternative to gene therapy for the delivery of therapeutic proteins into target cells.

[0079] Supply of molecules with CD72 activity should lead to a partial reversal of the hyper proliferation or hyper active state. Other molecules with CD72 activity may also be used to affect such a reversal, for example, peptides, drugs or organic compounds.

EXAMPLE 1

[0080] Antibody against CD72 was used to detect CD72 protein expression in tumor tissues. The loss of CD72 protein expression suggests mutations in the CD72 gene. The over-expression of CD72 also suggests mutations in the CD72 gene as the increased CD72 expression does not result in an inhibition of cell proliferation. Tissue samples from tumor tissue and the surrounding normal tissue may be procured through surgery or needle biopsy. Tissue homogenates are prepared in lysis buffer containing NP-40. The protein concentration in the homogenate can be quantified by BCA kits from Pierce Co.(Rockford, Ill.). Equal amounts of the protein from homogenates can be loaded into SDS-PAGE gels as described in FIG. 8. Alternatively, known amounts of protein from homogenates can be loaded into SDS-PAGE as described in FIG. 3. Western blotting of these electrophoresis gels is done as described in FIG. 4. The over-expression of CD72 (See FIG. 3) indicates a mutation in the CD72 from the endometrial carcinoma sample as the over-expression of CD72 does not inhibit cell growth. The loss of CD72 protein expression in lymphomas and the endometrial carcinoma of FIG. 8 also suggests a mutation in CD72 gene. Thus, we have used antibodies directed to CD72 to diagnose diseases with abnormal expression of CD72, including the cancers and lupus discussed above.

EXAMPLE 2

[0081] A plasmid expressing the normal CD72 cDNA with a CMV promoter was introduced into an endometrial adenocarcinoma cell line ECC-1, which contains a mutation in the CD72 gene, as described above. A control vector (pCDNA3.1+ from Invitrogen, Carlsbad, Calif.) without the CD72 cDNA was used as a control. The ECC-1 cells were grown to 80% confluency. The cells were then trypsinized and washed once in complete RPMI medium. EPBS (in 250 ml of water, 2.197 g NaCl, 96.6 mg NaH2PO4.H2O, 482.4 mg Na2HPO4.7H2O) was used to suspend the cells to a concentration of approximately 5 million per ml. The cells were mixed up with DNA in the 2 mm-gap electroporation cuvette and incubated at 4 C for 15 min. Electroporation was carried out using BTX 600 electroporator (Set voltage 185V, Capacitance: 960 mF). The cuvette was put back into ice again for 15 min and then the cells were transferred to flask containing the complete RPMI medium. The medium was changed every three days. After 9 days, the cells were trypsinized and equal number of cells from both the control and test samples were put into a culture chamber. The cells were counted using XTT kits purchased from Sigma Co. (St. Louis, Mo.) The growth of the ECC-1 cell line transfected with the CD72 expressing vector was significantly slowed when compared to the vector control. This suggests that introduction of normal CD72 into cells expressing a mutant CD72 inhibits cell growth in the cancer cell line. This suggests that CD72 is a tumor suppressor gene and that wild-type CD72 can be supplied to cells that have a mutant gene resulting in a defective CD72 protein.

[0082] While embodiments of the invention have been illustrated and described, it is not intended that these embodiments illustrate and describe all possible forms of the invention. Rather, the words used in the specification are words of description rather than limitation, and it is understood that various changes may be made without departing from the spirit and scope of the invention.

1 26 1 9653 DNA Homo sapiens 1 ctattggcag agctctaggg tccctggcag tgtcctgcca cacaccacca atcatccgag 60 ctgttggcat gtggtgagga ccttttcccc cttccttttt cctccccaca gctgcaggag 120 ggggattctg cggggcggat gtcactttgc taggggcggg ttcttcccac ccccacacta 180 catcttgttc tagctccttc ctatcccgga tgtaggccat tttggggttc ctagtagggg 240 tcacatcccg gtaaaagaac attagggagc acctttgggc acccctgctc agcagccccc 300 tttcccccaa ctggttccct ctctcctcca tcagccaggc ccccaccacc tatgagggcc 360 tcgcggtccc tgggtcctcc gccacaaatg cccccttttt ccttggggct gggtgggaag 420 ggacagcttt gggacttttt aaagaaagaa ctttgctagg aaagtaagag acgaaccaaa 480 accaaaccaa gaaaatgtca agacctccgc tggtccggaa tctgcgctcg gcctctggct 540 ctgttctgct cctctgcctt gacttcacct aggaagctgc agccttctta agccctctgc 600 aactcccttg ctcacctgct ccctggctcc ccatgagggt gcatgggggg cagtgctgcc 660 acttgccttt ccagaggtgc tgcagccatg gctcttcctt accttcacct ctggcccagg 720 cctgttcctg aacccccagt tactggaacc cccaggatga tgagaagcga gggatgcaaa 780 ccaggaaccc ccagtctcat ggggagaacg gagcctcacc ctggagctgg ggagatcgtc 840 tgggtcagtg cagctccagc tacacatcat aagccccagt ggacaaggca gccaggagga 900 gggtggggca gatcagggca gataaccaga aggaccgagt ttggggccca gcatggccca 960 tatgggttta agatgcatgg agaggaagca cagggaggac atggggacag gcccatcccc 1020 tacccctgtg ttctggacct cagcgtcccc tccgtggccc ctctggccct ggcccaggga 1080 tgccttctgc ttcattgagc ccatctgttt gagacttgct aacaccttcc tcttttgtga 1140 ctggaagtca cagggccaat tgctaagccg tgcagtcaca gagggaacac agagcctagt 1200 tgtaaacgga cagagacgag aggggcaagg gaggacagtg gatgacaggg aagacgagtg 1260 ggggcagagc tgctcaggac catggctgag gccatcacct atgcagatct gaggtttgtg 1320 aaggctcccc tgaagaagag catctccagc cggttaggac agggtaaggg ggatgaggcc 1380 tccccttgat cctgcatccc cacttcccgc cccacagatc ccattcccaa atcccacctt 1440 ggtccctttt ctcttctctc tagacccagg ggctgatgat gatggggaaa tcacctacga 1500 gaatgttcaa gtgcccgcag tcctaggggt gccctcaagc ttggcttctt ctgtactagg 1560 ggacaaagca ggtctggaga gcctggggga tgtgtgtttg ttggggggct gtgtcttgag 1620 ctggggagtc cacaatgttt ttgttgttgt tcttgagaca gggtcttgtt ctgtcactca 1680 ggctggagtg cagtggtgtg atcatggctc actgcagcct cgacctcctg ggctcaagtg 1740 atcctcctgc cctggcctcc caaagtgttg ggattgcagg cgtgcgcccc catgccatgg 1800 ccctcagtat tcttgagagc acaataccct ggggaaagga ggagggcggt tcttaggaaa 1860 cagaagaata ggaacaggat aaatgggagc caggggagga agaaagattc ctcccgcaag 1920 gtggagaccc ctttcgggga gataagggct gcggagggat agggcagccc tgtgggggat 1980 gagttctaat gggaaacggg gagccatggg agggaaaaga ggtgatggga gatggatgag 2040 agcaaagggg gagagggttt ggagccgaca gataaggggc ctctgaggaa acgataaagg 2100 gagaactagg agagatgagg gggtcccggg aaaagataca gtgagaccca ggaggaagcc 2160 gagagagtcc cgggcagagg aggaactcag gagagtgaca ctgtgtccca gagcagctct 2220 agcaggcaac gtagggcaga ctggagtttc ccagtggcga gggcgaagcg ggtgcgcagg 2280 aggagaaggg gcaacactcg gggcttccgg tctcacactg gatgctcccc tcgacagcgg 2340 tcaagtcgga gcagccaact gcgtcctgga gagccgtgac gtcaccagct gtcgggcgga 2400 ttctcccctg tgagtgtgcc tgtgcggggc agccgcgggg ccaagtgctc gcgcctgtcc 2460 cggctgcgga tggcgctggg ccccgcgggg tgcagctcct tccgggataa acccgggcgc 2520 agtccttccg acgcccgggc tgccgtgcca cattcacctt catccggtgc gcctgaatcc 2580 gacgtgcacc gcccccctgc ggatggtccc aattcaccta cctgcccccc ttcccctctc 2640 tccgtctgat atttccttcc ccttctccca ggtaacgccc ccgagtcccc accagcactg 2700 attctggaac accctctccc ctacctgcga aacaccccgc tacccgacag aaaccaccat 2760 gcttgccccc tcagctctgc accgccctcc tcttagggat gccttcccca ggggtctccc 2820 ccacggacct gtcccttcct ctctacattc tttacggggg ctgactgccc tcatccatcc 2880 cactgtcccc ctaaggccgc acaacctgcc tgcgatacct cctgctcggc ctgctcctca 2940 cctgcctgct gttaggagtg accgccatct gcctgggagt gcgctgtgag taaggctgtc 3000 cctacttccc acttacaccg aacgactttc gtcccgactc tcgccccaaa gctgatcttc 3060 ctcagcaggg atcatagtca ctaccaccac ctccttagct ttgggtttgg tttccagcca 3120 gggctgttgt cttcctgaat tgtcctcacc taatccagta cctccttacg ctttagtcag 3180 tcaaatcttg tatttcagcc tgtgatgcaa aaagttgcaa gaagcagaaa aagggctgat 3240 gctgcagtct cagggcacta gggcagggct ggagaagaca tccaccaaat ctctgtggcc 3300 aaacagatct gcaggtgtct cagcagctcc agcagacgaa cagggttctg gaagtcacta 3360 acagcagcct gaggcagcag ctccgcctca agataacgca gctgggacag agtgcagagg 3420 atctgcaggg gtccaggaga gagctggcgc agagtcagga agcactacag gtggaacaga 3480 gggctcatca ggcggccgaa gggcagctac aggcctgcca ggcagacaga cagaagacga 3540 aggagacctt gcaaagtgag gagcaacaga ggagggcctt ggagcagaag ctgagcaaca 3600 tggagaacag actgaagccc ttcttcacat gcggctcagc aggtatccgc tctggggaga 3660 ggagagaagg gagggatgga ttggagcaag gagatagtag tggcaggagt ctatcaccca 3720 ggggcctgtg aggtgttcag gaaaggggca gtgaaccaga gtggtggctg acatgagctg 3780 ttcagccttg gccaacagag agaaggacca agctgcatac tagatttagg gggacccacc 3840 cagctccaag gctcatcctg taggggagag gaggatggca gccagcagtc caaatcacac 3900 tgtccaaggt ttcttccggt tgatatcagt gacaaggatc tgggaacgcc cccaaatttg 3960 gatgtgaact tggcggtttg cacatctttc taaagacagg gacctcattt ttaattaatt 4020 ttcgttatag ctgagatctc cccaaagggt aagacttact ggctgacatg gagtgcaacc 4080 tggtggtggg gcaaggcagt gaggcgctca gcactcacag tgaggtggag ctgggagttg 4140 tggtgtaggg aactaggatg gggtgtccat tcattttgaa agcaggaccc cagccagtaa 4200 gtggagatgg ggaaagggac acataaaaga ggcaaggctg tatcaaaaca ggctgagggt 4260 ctaggcaagg tgccacgcct aaggggactg gggtgttttt gggctatgat caggtttagg 4320 ctctgtctag cttgggggtc atctgggtga tctccccagg cataagacag caagattatg 4380 tgcagacacc ccttgccctc atttccttaa ccttaccctc tgtgaccatc agttccatgc 4440 cagggctgct cttagcagca tgttagacct ctacctccca aattgctctt ccggcgaaag 4500 gatggtttca gacttccact ctgatcacag ctgcccacca ctcctgcagg ttttccccac 4560 acttttttga tctcatcaag gcctccaatc tgctgactcc cctttttctc ccttcccatc 4620 agtccaggcc tccaatcaac tgattcccct ttttctctct tcccatcagc ttctcctgtc 4680 cacacttctc tccctaccta acatgggttt ggagcccatc acttcaacta ctcacttatt 4740 tttttaaata gagacaaggt ctcactatgt ttccctggcc ggtatcgaac tcctggactc 4800 aagcgatcct cctgcctcag cctcccaaag tgctgagatt acagtcgtga gccaccgcac 4860 ctggctcact cacttctttg ttccattatc cttctagtgt acctgatggg caaaatttga 4920 actctggatg aacccaactg cccatggtca gcatcaaggt gtgacaaatt gtggaatcat 4980 tctgctcagt gccactgaga ctctagtctc tcagcactgc ctggcaacct ttgcaatggt 5040 cagcttgctc tccatagtaa ctccttctcc ctcactttca gcagatgtcc tcatcttcca 5100 ctctttcaga gaggaattcc cttcactttc catcaccaag tctatatatc catatggatt 5160 tttgcccatc ctcatctttt gcaaggatat ctgcaacagc tttctaaatg tttttcctac 5220 tcactcctct ccctgcagtc cattcttcat attggagcca gggtaatctt ttacaaacaa 5280 aaatctgatc gtgtcaccca atctacagca ttccagttaa ttgattacat aaacagttat 5340 ggaatacata acatataata tgttctagca ctgtgcccag gcacttggga tatagctagt 5400 gaacaaaatt cctctatgta aaaccctaca ctgactttct cttcctcata atataaaaat 5460 aaaaatcctt aacatggctt ccatcactcc tgcctacctt tcagtgtgcc acttatctcc 5520 tcaccctctt ccatttggcc atacagattt cttttttttt ttttttgagg cggagtctcg 5580 ctctgtgaga ctgaagtgca ctcccaggct ggagtgcagt ggtgtgatct gggctcactg 5640 caacctctgc ctcctggatt caagctatcc ttttgcctca gcctcccgag taactgggat 5700 tataggcatg cgccatggtg cctgggtatt ttttggtatt tttagtagag gtggggtttc 5760 accatgttga ccaggctggt cttaaactcc tgacttcagg tgatccaccc cgcctcggcc 5820 tcccaaagtg ctgggattac agaagtgagc cattgtgccc agcagggatt tctttttatt 5880 cctttaagtt gccataggac agtagctcaa gggagtagca gggtggggac atgagcatat 5940 gtagagagaa gagagggagc tagtggagag ggaatgattt ttgagtgatg aaggtaggac 6000 tgatggaaca aagcaatgga gagaggtggc ctgggatcag aagtgcacat tgggatgtag 6060 gatgttgaag aaagagggat atttatttct tatggatagt caggaaaaga agaacagaga 6120 agagaaattt tgaggtggag aggagggaaa aggagagagc tcagtttctg caaaacagta 6180 cacaacattt tctgcttgta ggaaattgga gagggacgga gggggccagt gctggaggag 6240 tggaggatgt gcagtgaggg gtgctttagg gagtggagga gatgggccca agagatgtta 6300 ggtaaagtgt cagggtggac tgcaattgtt cagaaattgg ggtaataaaa tcagccctat 6360 taagacagac ctcccagatt gctaagaagt aatgttgttg aatgttaaag tctttttact 6420 aagtaaggac aggtatggtg tttccagggg atgcaaaaga aagattgcaa agatcataga 6480 agtccatgaa ggtgctgaga ttagctctgg gggaatattg ttagcaaaag attgctgagt 6540 agtcactttc atcccaactg ttgctgctat cacactcttt ctgcttttta gacacctgct 6600 gtccgtcggg atggataatg catcagaaaa gctgctttta catctcactt acttcaaaaa 6660 attggcagga gagccaaaaa caatgtgaaa ctctgtcttc caagctggcc acattcagtg 6720 aaatttatcc acaatcagta agcatatcct tactcacaga ctgcagtggg tactattagg 6780 taaagggaca tatgtcatca agtcaggaat ggcagtgcac acatggctgg ggtggccatt 6840 ccttttccat ggaagaaact gagaatcata atactcttga cctctgagct tggaaacaac 6900 ctctgcatct tctcagcaca gtggtccagg cagcccctag taactgatta gagttggcat 6960 gagaatggaa acttaaccat atctacattt tttttttttt gagacagagt ctcactctgt 7020 cacccaggct ggagtgcact tgcataatct cagctcactg caacctccgc ctcctgggtt 7080 catgcaattc tcccgtctca gcctcctgag tagctgggac tacagtcaca tgccaccacg 7140 cccagctaat ttttgtattt ttaatagagt cggggtttca ccatattggt caggctggtc 7200 tctaactcct ggcctcaggt gatccacctg cttcagtctc ccaaagtgct gggattatag 7260 gtgtgaacca ctgcgcccgg cctctacttt aatgtgtatc tcatctcagc ttgctggtca 7320 gctccctggg ctgcttgaca ggttcctgga ctcttgtggc ctctcaggca gagaggtccc 7380 acacctggtg ggctggattg tagggattag ttccttgggg ccagaacagc acacagaatg 7440 actcccctcc aaaggctggc agagattctt actcagctgg aaggcaaatt tagcagtgag 7500 cctgcatctg cagggtggca cttgatggat tcagggacaa gttaccaaag agctgtcata 7560 tatacagccc ccttgccaaa gctgaggcct ccctagggtg tgcattttct tccatatttc 7620 tcatcactct ccatgtatct ctggtctctc tgaatctccc ctcagcactc ttactacttc 7680 ttaaattcac tgttgccaaa tggtggttca gggaattcat attggactgg cctcagctct 7740 aacaaggatt ggaagttgac tgatgataca caacgcacta ggtaagtttg ttaggacttc 7800 tgggtatttt caactccata atcccctcca taaagacatg gtaatcagat ttccttccca 7860 cctggtcccc tcaggactct cccataaatc aacactgtaa ttgcctattt gtctgtttcc 7920 tctatttgat taataacttc ttagggcaat cttattctcc attctatttc cacctagaag 7980 tccctggcac atactttttt gtttcagtaa ctactaggtt gggacctacg ctatctttta 8040 gggggagaca ttaaggattt agttaaggct ccacagctga tgacagcagc gtttctccat 8100 ccttcgagaa ggccacccaa actgaggttg tactttctca gtagcggcca cttggggcac 8160 tagagctgcc tcagtgaagt acgcaaggga gcagcggggg taccatcgaa gctccacttc 8220 actgtgatca agacagcagt ggggtaccat tgaggatcaa ccctagggac cctgaaacag 8280 ggtggcaacc tccttatgtt cctcaattgg ccttcccatc tttttttttt ttttgagatg 8340 gagtctcgct ctgtcgccca gactggagtg cagtggtgca atctcagctc actgcaagct 8400 ccgcctcagg ggttcacacc attctcctgc ctcagcctcc ccagcagctg ggactacagg 8460 cgcacaccgc catgcccggc taattttttg tatttttagt agagacgggg tttcaccgtg 8520 ttagtcagga tggtctcgat gtcctgacct tgtgatccac ccgccttggc ctcccaaagt 8580 gctgggattg caggcatgag ccaccgtgcc cagcctggcc ttcccatctt taagatctcg 8640 agttattttt ccctctgctg gttctaaagt ctgttttagt tgaactccag acatctgtcc 8700 tgaaccatgc ccagatgtgg ccttaggcag gcagggtgag ttaggattta ggttttatac 8760 aggggaaggg gagagaaggg agggtgggat atgtccagag catatcccag ctctacctta 8820 ctacttttta ggacttatgc tcaaagctca aaatgtaaca aggtacataa aacttggtca 8880 tggtggacac tggagtcaga gtcatgtaga agttctcttc cctacatctg tgagatgaca 8940 gctttcaggt ttccagatta ggacagtcct ttgcactgag ttggtaagag tgagggatgg 9000 ggctgaggca tggggagtcc aggggcagag ggactcagag cagggactca gagcaggggc 9060 ccaggggccc gagagaaaac ttgtacctgg gatgaaagtt cctccacagc ctccagttgc 9120 ttggtctgct tgttagggtg gggtgggcag ggagggacag gtaaggccta caacttatgg 9180 gaggctggtg gtgtggctgt cgagagctga ggaggagatg gggtgagtgg gatgagggtt 9240 cagttaagtc aaccatggag ctcactgata ctttactttt ccagacactc atgccaacaa 9300 gaacctgtgc ccctccttcc taacctgagg cctggggttc ctcagaccat ctccttcatt 9360 ctgggcagtg cccagccacc ggctgaccca cacctgacac ttccagccag tctgctgcct 9420 gctccctctt cctgaaactg gactgttcct gggaaaaggg tgaagccacc tctagaaggg 9480 actttggcct ccccccaaga acttcccatg gtagaatggg gtgggggagg agggcgcacg 9540 ggctgagcgg ataggggcgg cccggagcca gccaggcagt tttattgaaa tctttttaaa 9600 taattgcacg tgttagtctc atgtgtcagc aatgctgtgt ctggttcagt gat 9653 2 1532 DNA Homo sapiens 2 agtcacagag ggaacacaga gcctagttgt aaacggacag agacgagagg ggcaagggag 60 gacagtggat gacagggaag acgagtgggg gcagagctgc tcaggaccat ggctgaggcc 120 atcacctatg cagatctgag gtttgtgaag gctcccctga agaagagcat ctccagccgg 180 ttaggacagg acccaggggc tgatgatgat ggggaaatca cctacgagaa tgttcaagtg 240 cccgcagtcc taggggtgcc ctcaagcttg gcttcttctg tactagggga caaagcagcg 300 gtcaagtcgg agcagccaac tgcgtcctgg agagccgtga cgtcaccagc tgtcgggcgg 360 attctcccct gccgcacaac ctgcctgcga tacctcctgc tcggcctgct cctcacctgc 420 ctgctgttag gagtgaccgc catctgcctg ggagtgcgct atctgcaggt gtctcagcag 480 ctccagcaga cgaacagggt tctggaagtc actaacagca gcctgaggca gcagctccgc 540 ctcaagataa cgcagctggg acagagtgca gaggatctgc aggggtccag gagagagctg 600 gcgcagagtc aggaagcact acaggtggaa cagagggctc atcaggcggc cgaagggcag 660 ctacaggcct gccaggcaga cagacagaag acgaaggaga ccttgcaaag tgaggagcaa 720 cagaggaggg ccttggagca gaagctgagc aacatggaga acagactgaa gcccttcttc 780 acatgcggct cagcagacac ctgctgtccg tcgggatgga taatgcatca gaaaagctgc 840 ttttacatct cacttacttc aaaaaattgg caggagagcc aaaaacaatg tgaaactctg 900 tcttccaagc tggccacatt cagtgaaatt tatccacaat cacactctta ctacttctta 960 aattcactgt tgccaaatgg tggttcaggg aattcatatt ggactggcct cagctctaac 1020 aaggattgga agttgactga tgatacacaa cgcactagga cttatgctca aagctcaaaa 1080 tgtaacaagg tacataaaac ttggtcatgg tggacactgg agtcagagtc atgtagaagt 1140 tctcttccct acatctgtga gatgacagct ttcaggtttc cagattagga cagtcctttg 1200 cactgagttg acactcatgc caacaagaac ctgtgcccct ccttcctaac ctgaggcctg 1260 gggttcctca gaccatctcc ttcattctgg gcagtgccca gccaccggct gacccacacc 1320 tgacacttcc agccagtctg ctgcctgctc cctcttcctg aaactggact gttcctggga 1380 aaagggtgaa gccacctcta gaagggactt tggcctcccc ccaagaactt cccatggtag 1440 aatggggtgg gggaggaggg cgcacgggct gagcggatag gggcggcccg gagccagcca 1500 ggcagtttta ttgaaatctt tttaaataat tg 1532 3 359 PRT Homo sapiens 3 Met Ala Glu Ala Ile Thr Tyr Ala Asp Leu Arg Phe Val Lys Ala Pro 1 5 10 15 Leu Lys Lys Ser Ile Ser Ser Arg Leu Gly Gln Asp Pro Gly Ala Asp 20 25 30 Asp Asp Gly Glu Ile Thr Tyr Glu Asn Val Gln Val Pro Ala Val Leu 35 40 45 Gly Val Pro Ser Ser Leu Ala Ser Ser Val Leu Gly Asp Lys Ala Ala 50 55 60 Val Lys Ser Glu Gln Pro Thr Ala Ser Trp Arg Ala Val Thr Ser Pro 65 70 75 80 Ala Val Gly Arg Ile Leu Pro Cys Arg Thr Thr Cys Leu Arg Tyr Leu 85 90 95 Leu Leu Gly Leu Leu Leu Thr Cys Leu Leu Leu Gly Val Thr Ala Ile 100 105 110 Cys Leu Gly Val Arg Tyr Leu Gln Val Ser Gln Gln Leu Gln Gln Thr 115 120 125 Asn Arg Val Leu Glu Val Thr Asn Ser Ser Leu Arg Gln Gln Leu Arg 130 135 140 Leu Lys Ile Thr Gln Leu Gly Gln Ser Ala Glu Asp Leu Gln Gly Ser 145 150 155 160 Arg Arg Glu Leu Ala Gln Ser Gln Glu Ala Leu Gln Val Glu Gln Arg 165 170 175 Ala His Gln Ala Ala Glu Gly Gln Leu Gln Ala Cys Gln Ala Asp Arg 180 185 190 Gln Lys Thr Lys Glu Thr Leu Gln Ser Glu Glu Gln Gln Arg Arg Ala 195 200 205 Leu Glu Gln Lys Leu Ser Asn Met Glu Asn Arg Leu Lys Pro Phe Phe 210 215 220 Thr Cys Gly Ser Ala Asp Thr Cys Cys Pro Ser Gly Trp Ile Met His 225 230 235 240 Gln Lys Ser Cys Phe Tyr Ile Ser Leu Thr Ser Lys Asn Trp Gln Glu 245 250 255 Ser Gln Lys Gln Cys Glu Thr Leu Ser Ser Lys Leu Ala Thr Phe Ser 260 265 270 Glu Ile Tyr Pro Gln Ser His Ser Tyr Tyr Phe Leu Asn Ser Leu Leu 275 280 285 Pro Asn Gly Gly Ser Gly Asn Ser Tyr Trp Thr Gly Leu Ser Ser Asn 290 295 300 Lys Asp Trp Lys Leu Thr Asp Asp Thr Gln Arg Thr Arg Thr Tyr Ala 305 310 315 320 Gln Ser Ser Lys Cys Asn Lys Val His Lys Thr Trp Ser Trp Trp Thr 325 330 335 Leu Glu Ser Glu Ser Cys Arg Ser Ser Leu Pro Tyr Ile Cys Glu Met 340 345 350 Thr Ala Phe Arg Phe Pro Asp 355 4 9373 DNA Homo sapiens 4 ggccacccca gggtctggtc cctgacgacg cgcagtgagg gccccgccgc taccccagca 60 gtcgcctccc aagttcgcgg aacgcagctg accggctccc tctggactgg gtgacatgac 120 tgctcccaag cagtcgtttg taaactgagt ttctgtaaaa caattttatt tttcatatgt 180 gactgtagcg gggtatgatt tgaactttgt tttccgtccc ccagcccgga ttctctgtct 240 tctcctgtac agccgttccg ttttcttacc tcgtctccgt caccgaggcc ctcagccctg 300 aacacaagga ctgggcagtt tccctattga ttcctgaacc tggaacttaa gacatcttcc 360 gaggggcccc cccttgccac accctctagc tgatcgactc acaaatacct gtgatttctc 420 tccccgtctc cacctccagg cctgttcctg aacccccagt tactggaacc cccaggatga 480 tgagaagcga gggatgcaaa ccaggaaccc ccagtctcat ggggagaacg gagcctcacc 540 ctggagctgg ggagatcgtc tgggtcagtg cagctccagc tacacatcat aagccccagt 600 ggacaaggca gccaggagga gggtggggca gatcagggca gataaccaga aggaccgagt 660 ttggggccca gcatggccca tatgggttta agatgcatgg agaggaagca cagggaggac 720 atggggacag gcccatcccc tacccctgtg ttctggacct cagcgtcccc tccgtggccc 780 ctctggccct ggcccaggga tgccttctgc ttcattgagc ccatctgttt gagacttgct 840 aacaccttcc tcttttgtga ctggaagtca cagggccaat tgctaagccg tgcagtcaca 900 gagggaacac agagcctagt tgtaaacgga cagagacgag aggggcaagg gaggacagtg 960 gatgacaggg aagacgagtg ggggcagagc tgctcaggac catggctgag gccatcacct 1020 atgcagatct gaggtttgtg aaggctcccc tgaagaagag catctccagc cggttaggac 1080 agggtaaggg ggatgaggcc tccccttgat cctgcatccc cacttcccgc cccacagatc 1140 ccattcccaa atcccacctt ggtccctttt ctcttctctc tagacccagg ggctgatgat 1200 gatggggaaa tcacctacga gaatgttcaa gtgcccgcag tcctaggggt gccctcaagc 1260 ttggcttctt ctgtactagg ggacaaagca ggtctggaga gcctggggga tgtgtgtttg 1320 ttggggggct gtgtcttgag ctggggagtc cacaatgttt ttgttgttgt tcttgagaca 1380 gggtcttgtt ctgtcactca ggctggagtg cagtggtgtg atcatggctc actgcagcct 1440 cgacctcctg ggctcaagtg atcctcctgc cctggcctcc caaagtgttg ggattgcagg 1500 cgtgcgcccc catgccatgg ccctcagtat tcttgagagc acaataccct ggggaaagga 1560 ggagggcggt tcttaggaaa cagaagaata ggaacaggat aaatgggagc caggggagga 1620 agaaagattc ctcccgcaag gtggagaccc ctttcgggga gataagggct gcggagggat 1680 agggcagccc tgtgggggat gagttctaat gggaaacggg gagccatggg agggaaaaga 1740 ggtgatggga gatggatgag agcaaagggg gagagggttt ggagccgaca gataaggggc 1800 ctctgaggaa acgataaagg gagaactagg agagatgagg gggtcccggg aaaagataca 1860 gtgagaccca ggaggaagcc gagagagtcc cgggcagagg aggaactcag gagagtgaca 1920 ctgtgtccca gagcagctct agcaggcaac gtagggcaga ctggagtttc ccagtggcga 1980 gggcgaagcg ggtgcgcagg aggagaaggg gcaacactcg gggcttccgg tctcacactg 2040 gatgctcccc tcgacagcgg tcaagtcgga gcagccaact gcgtcctgga gagccgtgac 2100 gtcaccagct gtcgggcgga ttctcccctg tgagtgtgcc tgtgcggggc agccgcgggg 2160 ccaagtgctc gcgcctgtcc cggctgcgga tggcgctggg ccccgcgggg tgcagctcct 2220 tccgggataa acccgggcgc agtccttccg acgcccgggc tgccgtgcca cattcacctt 2280 catccggtgc gcctgaatcc gacgtgcacc gcccccctgc ggatggtccc aattcaccta 2340 cctgcccccc ttcccctctc tccgtctgat atttccttcc ccttctccca ggtaacgccc 2400 ccgagtcccc accagcactg attctggaac accctctccc ctacctgcga aacaccccgc 2460 tacccgacag aaaccaccat gcttgccccc tcagctctgc accgccctcc tcttagggat 2520 gccttcccca ggggtctccc ccacggacct gtcccttcct ctctacattc tttacggggg 2580 ctgactgccc tcatccatcc cactgtcccc ctaaggccgc acaacctgcc tgcgatacct 2640 cctgctcggc ctgctcctca cctgcctgct gttaggagtg accgccatct gcctgggagt 2700 gcgctgtgag taaggctgtc cctacttccc acttacaccg aacgactttc gtcccgactc 2760 tcgccccaaa gctgatcttc ctcagcaggg atcatagtca ctaccaccac ctccttagct 2820 ttgggtttgg tttccagcca gggctgttgt cttcctgaat tgtcctcacc taatccagta 2880 cctccttacg ctttagtcag tcaaatcttg tatttcagcc tgtgatgcaa aaagttgcaa 2940 gaagcagaaa aagggctgat gctgcagtct cagggcacta gggcagggct ggagaagaca 3000 tccaccaaat ctctgtggcc aaacagatct gcaggtgtct cagcagctcc agcagacgaa 3060 cagggttctg gaagtcacta acagcagcct gaggcagcag ctccgcctca agataacgca 3120 gctgggacag agtgcagagg atctgcaggg gtccaggaga gagctggcgc agagtcagga 3180 agcactacag gtggaacaga gggctcatca ggcggccgaa gggcagctac aggcctgcca 3240 ggcagacaga cagaagacga aggagacctt gcaaagtgag gagcaacaga ggagggcctt 3300 ggagcagaag ctcagcaaca tggagaacag actgaagccc ttcttcacat gcggctcagc 3360 aggtatccgc tctggggaga ggagagaagg gagggatgga ttggagcaag gagatagtag 3420 tggcaggagt ctatcaccca ggggcctgtg aggtgttcag gaaaggggca gtgaaccaga 3480 gtggtggctg acatgagctg ttcagccttg gccaacagag agaaggacca agctgcatac 3540 tagatttagg gggacccacc cagctccaag gctcatcctg taggggagag gaggatggca 3600 gccagcagtc caaatcacac tgtccaaggt ttcttccggt tgatatcagt gacaaggatc 3660 tgggaacgcc cccaaatttg gatgtgaact tggcggtttg cacatctttc taaagacagg 3720 gacctcattt ttaattaatt ttcgttatag ctgagatctc cccaaagggt aagacttact 3780 ggctgacatg gagtgcaacc tggtggtggg gcaaggcagt gaggcgctca gcactcacag 3840 tgaggtggag ctgggagttg tggtgtaggg aactaggatg gggtgtccat tcattttgaa 3900 agcaggaccc cagccagtaa gtggagatgg ggaaagggac acataaaaga ggcaaggctg 3960 tatcaaaaca ggctgagggt ctaggcaagg tgccacgcct aaggggactg gggtgttttt 4020 gggctatgat caggtttagg ctctgtctag cttgggggtc atctgggtga tctccccagg 4080 cataagacag caagattatg tgcagacacc ccttgccctc atttccttaa ccttaccctc 4140 tgtgaccatc agttccatgc cagggctgct cttagcagca tgttagacct ctacctccca 4200 aattgctctt ccggcgaaag gatggtttca gacttccact ctgatcacag ctgcccacca 4260 ctcctgcagg ttttccccac acttttttga tctcatcaag gcctccaatc tgctgactcc 4320 cctttttctc ccttcccatc agtccaggcc tccaatcaac tgattcccct ttttctctct 4380 tcccatcagc ttctcctgtc cacacttctc tccctaccta acatgggttt ggagcccatc 4440 acttcaacta ctcacttatt tttttaaata gagacaaggt ctcactatgt ttccctggcc 4500 ggtatcgaac tcctggactc aagcgatcct cctgcctcag cctcccaaag tgctgagatt 4560 acagtcgtga gccaccgcac ctggctcact cacttctttg ttccattatc cttctagtgt 4620 acctgatggg caaaatttga actctggatg aacccaactg cccatggtca gcatcaaggt 4680 gtgacaaatt gtggaatcat tctgctcagt gccactgaga ctctagtctc tcagcactgc 4740 ctggcaacct ttgcaatggt cagcttgctc tccatagtaa ctccttctcc ctcactttca 4800 gcagatgtcc tcatcttcca ctctttcaga gaggaattcc cttcactttc catcaccaag 4860 tctatatatc catatggatt tttgcccatc ctcatctttt gcaaggatat ctgcaacagc 4920 tttctaaatg tttttcctac tcactcctct ccctgcagtc cattcttcat attggagcca 4980 gggtaatctt ttacaaacaa aaatctgatc gtgtcaccca atctacagca ttccagttaa 5040 ttgattacat aaacagttat ggaatacata acatataata tgttctagca ctgtgcccag 5100 gcacttggga tatagctagt gaacaaaatt cctctatgta aaaccctaca ctgactttct 5160 cttcctcata atataaaaat aaaaatcctt aacatggctt ccatcactcc tgcctacctt 5220 tcagtgtgcc acttatctcc tcaccctctt ccatttggcc atacagattt cttttttttt 5280 ttttttgagg cggagtctcg ctctgtgaga ctgaagtgca ctcccaggct ggagtgcagt 5340 ggtgtgatct gggctcactg caacctctgc ctcctggatt caagctatcc ttttgcctca 5400 gcctcccgag taactgggat tataggcatg cgccatggtg cctgggtatt ttttggtatt 5460 tttagtagag gtggggtttc accatgttga ccaggctggt cttaaactcc tgacttcagg 5520 tgatccaccc cgcctcggcc tcccaaagtg ctgggattac agaagtgagc cattgtgccc 5580 agcagggatt tctttttatt cctttaagtt gccataggac agtagctcaa gggagtagca 5640 gggtggggac atgagcatat gtagagagaa gagagggagc tagtggagag ggaatgattt 5700 ttgagtgatg aaggtaggac tgatggaaca aagcaatgga gagaggtggc ctgggatcag 5760 aagtgcacat tgggatgtag gatgttgaag aaagagggat atttatttct tatggatagt 5820 caggaaaaga agaacagaga agagaaattt tgaggtggag aggagggaaa aggagagagc 5880 tcagtttctg caaaacagta cacaacattt tctgcttgta ggaaattgga gagggacgga 5940 gggggccagt gctggaggag tggaggatgt gcagtgaggg gtgctttagg gagtggagga 6000 gatgggccca agagatgtta ggtaaagtgt cagggtggac tgcaattgtt cagaaattgg 6060 ggtaataaaa tcagccctat taagacagac ctcccagatt gctaagaagt aatgttgttg 6120 aatgttaaag tctttttact aagtaaggac aggtatggtg tttccagggg atgcaaaaga 6180 aagattgcaa agatcataga agtccatgaa ggtgctgaga ttagctctgg gggaatattg 6240 ttagcaaaag attgctgagt agtcactttc atcccaactg ttgctgctat cacactcttt 6300 ctgcttttta gacacctgct gtctgtcggg atggataatg catcagaaaa gctgctttta 6360 catctcactt acttcaaaaa attggcagga gagccaaaaa caatgtgaaa ctctgtcttc 6420 caagctggcc acattcagtg aaatttatcc acaatcagta agcatatcct tactcacaga 6480 ctgcagtggg tactattagg taaagggaca tatgtcatca agtcaggaat ggcagtgcac 6540 acatggctgg ggtggccatt ccttttccat ggaagaaact gagaatcata atactcttga 6600 cctctgagct tggaaacaac ctctgcatct tctcagcaca gtggtccagg cagcccctag 6660 taactgatta gagttggcat gagaatggaa acttaaccat atctacattt tttttttttt 6720 gagacagagt ctcactctgt cacccaggct ggagtgcact tgcataatct cagctcactg 6780 caacctccgc ctcctgggtt catgcaattc tcccgtctca gcctcctgag tagctgggac 6840 tacagtcaca tgccaccacg cccagctaat ttttgtattt ttaatagagt cggggtttca 6900 ccatattggt caggctggtc tctaactcct ggcctcaggt gatccacctg cttcagtctc 6960 ccaaagtgct gggattatag gtgtgaacca ctgcgcccgg cctctacttt aatgtgtatc 7020 tcatctcagc ttgctggtca gctccctggg ctgcttgaca ggttcctgga ctcttgtggc 7080 ctctcaggca gagaggtccc acacctggtg ggctggattg tagggattag ttccttgggg 7140 ccagaacagc acacagaatg actcccctcc aaaggctggc agagattctt actcagctgg 7200 aaggcaaatt tagcagtgag cctgcatctg cagggtggca cttgatggat tcagggacaa 7260 gttaccaaag agctgtcata tatacagccc ccttgccaaa gctgaggcct ccctagggtg 7320 tgcattttct tccatatttc tcatcactct ccatgtatct ctggtctctc tgaatctccc 7380 ctcagcactc ttactacttc ttaaattcac tgttgccaaa tggtggttca gggaattcat 7440 attggactgg cctcagctct aacaaggatt ggaagttgac tgatgataca caacgcacta 7500 ggtaagtttg ttaggacttc tgggtatttt caactccata atcccctcca taaagacatg 7560 gtaatcagat ttccttccca cctggtcccc tcaggactct cccataaatc aacactgtaa 7620 ttgcctattt gtctgtttcc tctatttgat taataacttc ttagggcaat cttattctcc 7680 attctatttc cacctagaag tccctggcac atactttttt gtttcagtaa ctactaggtt 7740 gggacctacg ctatctttta gggggagaca ttaaggattt agttaaggct ccacagctga 7800 tgacagcagc gtttctccat ccttcgagaa ggccacccaa actgaggttg tactttctca 7860 gtagcggcca cttggggcac tagagctgcc tcagtgaagt acgcaaggga gcagcggggg 7920 taccatcgaa gctccacttc actgtgatca agacagcagt ggggtaccat tgaggatcaa 7980 ccctagggac cctgaaacag ggtggcaacc tccttatgtt cctcaattgg ccttcccatc 8040 tttttttttt ttttgagatg gagtctcgct ctgtcgccca gactggagtg cagtggtgca 8100 atctcagctc actgcaagct ccgcctcagg ggttcacacc attctcctgc ctcagcctcc 8160 ccagcagctg ggactacagg cgcacaccgc catgcccggc taattttttg tatttttagt 8220 agagacgggg tttcaccgtg ttagtcagga tggtctcgat gtcctgacct tgtgatccac 8280 ccgccttggc ctcccaaagt gctgggattg caggcatgag ccaccgtgcc cagcctggcc 8340 ttcccatctt taagatctcg agttattttt ccctctgctg gttctaaagt ctgttttagt 8400 tgaactccag acatctgtcc tgaaccatgc ccagatgtgg ccttaggcag gcagggtgag 8460 ttaggattta ggttttatac aggggaaggg gagagaaggg agggtgggat atgtccagag 8520 catatcccag ctctacctta ctacttttta ggacttatgc tcaaagctca aaatgtaaca 8580 aggtacataa aacttggtca tggtggacac tggagtcaga gtcatgtaga agttctcttc 8640 cctacatctg tgagatgaca gctttcaggt ttccagatta ggacagtcct ttgcactgag 8700 ttggtaagag tgagggatgg ggctgaggca tggggagtcc aggggcagag ggactcagag 8760 cagggactca gagcaggggc ccaggggccc gagagaaaac ttgtacctgg gatgaaagtt 8820 cctccacagc ctccagttgc ttggtctgct tgttagggtg gggtgggcag ggagggacag 8880 gtaaggccta caacttatgg gaggctggtg gtgtggctgt cgagagctga ggaggagatg 8940 gggtgagtgg gatgagggtt cagttaagtc aaccatggag ctcactgata ctttactttt 9000 ccagacactc atgccaacaa gaacctgtgc ccctccttcc taacctgagg cctggggttc 9060 ctcagaccat ctccttcatt ctgggcagtg cccagccacc ggctgaccca cacctgacac 9120 ttccagccag tctgctgcct gctccctctt cctgaaactg gactgttcct gggaaaaggg 9180 tgaagccacc tctagaaggg actttggcct ccccccaaga acttcccatg gtagaatggg 9240 gtgggggagg agggcgcacg ggctgagcgg ataggggcgg cccggagcca gccaggcagt 9300 tttattgaaa tctttttaaa taattgcacg tgttagtctc atgtgtcagc aatgctgtgt 9360 ctggttcagt gat 9373 5 1917 DNA Homo sapiens 5 ggccacccca gggtctggtc cctgacgacg cgcagtgagg gccccgccgc taccccagca 60 gtcgcctccc aagttcgcgg aacgcagctg accggctccc tctggactgg gtgacatgac 120 tgctcccaag cagtcgtttg taaactgagt ttctgtaaaa caattttatt tttcatatgt 180 gactgtagcg gggtatgatt tgaactttgt tttccgtccc ccagcccgga ttctctgtct 240 tctcctgtac agccgttccg ttttcttacc tcgtctccgt caccgaggcc ctcagccctg 300 aacacaagga ctgggcagtt tccctattga ttcctgaacc tggaacttaa gacatcttcc 360 gaggggcccc cccttgccac accctctagc tgatcgactc acaaatacct gtgatttctc 420 tccccgtctc cacctccagg cctgttcctg aacccccagt tactggaacc cccaggatga 480 tgagaagcga gggatgcaaa ccaggaaccc ccagtctcat ggggagaacg gagcctcacc 540 ctggagctgg ggagatcgtc tggacccagg ggcggaccca ggggctgatg atgatgggga 600 aatcacctac gagaatgttc aagtgcccgc agtcctaggg gtgccctcaa gcttggcttc 660 ttctgtacta ggggacaaag cagcggtcaa gtcggagcag ccaactgcgt cctggagagc 720 cgtgacgtca ccagctgtcg ggcggattct cccctgccgc acaacctgcc tgcgatacct 780 cctgctcggc ctgctcctca cctgcctgct gttaggagtg accgccatct gcctgggagt 840 gcgctatctg caggtgtctc agcagctcca gcagacgaac agggttctgg aagtcactaa 900 cagcagcctg aggcagcagc tccgcctcaa gataacgcag ctgggacaga gtgcagagga 960 tctgcagggg tccaggagag agctggcgca gagtcaggaa gcactacagg tggaacagag 1020 ggctcatcag gcggccgaag ggcagctaca ggcctgccag gcagacagac agaagacgaa 1080 ggagaccttg caaagtgagg agcaacagag gagggccttg gagcagaagc tcagcaacat 1140 ggagaacaga ctgaagccct tcttcacatg cggctcagca gacacctgct gtctgtcggg 1200 atggataatg catcagaaaa gctgctttta catctcactt acttcaaaaa attggcagga 1260 gagccaaaaa caatgtgaaa ctctgtcttc caagctggcc acattcagtg aaatttatcc 1320 acaatcacac tcttactact tcttaaattc actgttgcca aatggtggtt cagggaattc 1380 atattggact ggcctcagct ctaacaagga ttggaagttg actgatgata cacaacgcac 1440 taggacttat gctcaaagct caaaatgtaa caaggtacat aaaacttggt catggtggac 1500 actggagtca gagtcatgta gaagttctct tccctacatc tgtgagatga cagctttcag 1560 gtttccagat taggacagtc ctttgcactg agttgacact catgccaaca agaacctgtg 1620 cccctccttc ctaacctgag gcctggggtt cctcagacca tctccttcat tctgggcagt 1680 gcccagccac cggctgaccc acacctgaca cttccagcca gtctgctgcc tgctccctct 1740 tcctgaaact ggactgttcc tgggaaaagg gtgaagccac ctctagaagg gactttggcc 1800 tccccccaag aacttcccat ggtagaatgg ggtgggggag gagggcgcac gggctgagcg 1860 gataggggcg gcccggagcc agccaggcag ttttattgaa atctttttaa ataattg 1917 6 359 PRT Homo sapiens 6 Met Gln Thr Arg Asn Pro Gln Ser His Gly Glu Asn Gly Ala Ser Pro 1 5 10 15 Trp Ser Trp Gly Asp Arg Leu Asp Pro Gly Ala Asp Pro Gly Ala Asp 20 25 30 Asp Asp Gly Glu Ile Thr Tyr Glu Asn Val Gln Val Pro Ala Val Leu 35 40 45 Gly Val Pro Ser Ser Leu Ala Ser Ser Val Leu Gly Asp Lys Ala Ala 50 55 60 Val Lys Ser Glu Gln Pro Thr Ala Ser Trp Arg Ala Val Thr Ser Pro 65 70 75 80 Ala Val Gly Arg Ile Leu Pro Cys Arg Thr Thr Cys Leu Arg Tyr Leu 85 90 95 Leu Leu Gly Leu Leu Leu Thr Cys Leu Leu Leu Gly Val Thr Ala Ile 100 105 110 Cys Leu Gly Val Arg Tyr Leu Gln Val Ser Gln Gln Leu Gln Gln Thr 115 120 125 Asn Arg Val Leu Glu Val Thr Asn Ser Ser Leu Arg Gln Gln Leu Arg 130 135 140 Leu Lys Ile Thr Gln Leu Gly Gln Ser Ala Glu Asp Leu Gln Gly Ser 145 150 155 160 Arg Arg Glu Leu Ala Gln Ser Gln Glu Ala Leu Gln Val Glu Gln Arg 165 170 175 Ala His Gln Ala Ala Glu Gly Gln Leu Gln Ala Cys Gln Ala Asp Arg 180 185 190 Gln Lys Thr Lys Glu Thr Leu Gln Ser Glu Glu Gln Gln Arg Arg Ala 195 200 205 Leu Glu Gln Lys Leu Ser Asn Met Glu Asn Arg Leu Lys Pro Phe Phe 210 215 220 Thr Cys Gly Ser Ala Asp Thr Cys Cys Pro Ser Gly Trp Ile Met His 225 230 235 240 Gln Lys Ser Cys Phe Tyr Ile Ser Leu Thr Ser Lys Asn Trp Gln Glu 245 250 255 Ser Gln Lys Gln Cys Glu Thr Leu Ser Ser Lys Leu Ala Thr Phe Ser 260 265 270 Glu Ile Tyr Pro Gln Ser His Ser Tyr Tyr Phe Leu Asn Ser Leu Leu 275 280 285 Pro Asn Gly Gly Ser Gly Asn Ser Tyr Trp Thr Gly Leu Ser Ser Asn 290 295 300 Lys Asp Trp Lys Leu Thr Asp Asp Thr Gln Arg Thr Arg Thr Tyr Ala 305 310 315 320 Gln Ser Ser Lys Cys Asn Lys Val His Lys Thr Trp Ser Trp Trp Thr 325 330 335 Leu Glu Ser Glu Ser Cys Arg Ser Ser Leu Pro Tyr Ile Cys Glu Met 340 345 350 Thr Ala Phe Arg Phe Pro Asp 355 7 1387 DNA Homo sapiens 7 gctcgcgcct gtcccggctg cggatggcgc tgggccccgc ggggtgcagc tccttccggg 60 ataaacccgg gcgcagtcct tccgacgccc gggctgccgt gccacattca ccttcatccg 120 gtgcgcctga atccgacgtg caccgccccc ctgcggatgg tcccaattca cctacctgcc 180 ccccttcccc tctctccgtc tgatatttcc ttccccttct cccaggccgc acaacctgcc 240 tgcgatacct cctgctcggc ctgctcctca cctgcctgct gttaggagtg accgccatct 300 gcctgggagt gcgctatctg caggtgtctc agcagctcca gcagacgaac agggttctgg 360 aagtcactaa cagcagcctg aggcagcagc tccgcctcaa gataacgcag ctgggacaga 420 gtgcagagga tctgcagggg tccaggagag agctggcgca gagtcaggaa gcactacagg 480 tggaacagag ggctcatcag gcggccgaag ggcagctaca ggcctgccag gcagacagac 540 agaagacgaa ggagaccttg caaagtgagg agcaacagag gagggccttg gagcagaagc 600 tgagcaacat ggagaacaga ctgaagccct tcttcacatg cggctcagca gacacctgct 660 gtccgtcggg atggataatg catcagaaaa gctgctttta catctcactt acttcaaaaa 720 attggcagga gagccaaaaa caatgtgaaa ctctgtcttc caagctggcc acattcagtg 780 aaatttatcc acaatcacac tcttactact tcttaaattc actgttgcca aatggtggtt 840 cagggaattc atattggact ggcctcagct ctaacaagga ttggaagttg actgatgata 900 cacaacgcac taggacttat gctcaaagct caaaatgtaa caaggtacat aaaacttggt 960 catggtggac actggagtca gagtcatgta gaagttctct tccctacatc tgtgagatga 1020 cagctttcag gtttccagat taggacagtc ctttgcactg agttgacact catgccaaca 1080 agaacctgtg cccctccttc ctaacctgag gcctggggtt cctcagacca tctccttcat 1140 tctgggcagt gcccagccac cggctgaccc acacctgaca cttccagcca gtctgctgcc 1200 tgctccctct tcctgaaact ggactgttcc tgggaaaagg gtgaagccac ctctagaagg 1260 gactttggcc tccccccaag aacttcccat ggtagaatgg ggtgggggag gagggcgcac 1320 gggctgagcg gataggggcg gcccggagcc agccaggcag ttttattgaa atctttttaa 1380 ataattg 1387 8 42 PRT Homo sapiens 8 Gly Ser Arg Leu Ser Arg Leu Arg Met Ala Leu Gly Pro Ala Gly Ser 1 5 10 15 Ala Pro Ser Gly Ile Asn Pro Gly Ala Val Leu Pro Thr Pro Gly Leu 20 25 30 Pro Cys His Ile His Leu His Pro Val Arg 35 40 9 9592 DNA Homo sapiens 9 ctattggcag agctctaggg tccctggcag tgtcctgcca cacaccacca atcatccgag 60 ctgttggcat gtggtgagga ccttttcccc cttccttttt cctccccaca gctgcaggag 120 ggggattctg cggggcggat gtcactttgc taggggcggg ttcttcccac ccccacacta 180 catcttgttc tagctccttc ctatcccgga tgtaggccat tttggggttc ctagtagggg 240 tcacatcccg gtaaaagaac attagggagc acctttgggc acccctgctc agcagccccc 300 tttcccccaa ctggttccct ctctcctcca tcagccaggc ccccaccacc tatgagggcc 360 tcgcggtccc tgggtcctcc gccacaaatg cccccttttt ccttggggct gggtgggaag 420 ggacagcttt gggacttttt aaagaaagaa ctttgctagg aaagtaagag acgaaccaaa 480 accaaaccaa gaaaatgtca agacctccgc tggtccggaa tctgcgctcg gcctctggct 540 ctgttctgct cctctgcctt gacttcacct aggaagctgc agccttctta agccctctgc 600 aactcccttg ctcacctgct ccctggctcc ccatgagggt gcatgggggg cagtgctgcc 660 acttgccttt ccagaggtgc tgcagccatg gctcttcctt accttcacct ctggcccagg 720 cctgttcctg aacccccagt tactggaacc cccaggatga tgagaagcga gggatgcaaa 780 ccaggaaccc ccagtctcat ggggagaacg gagcctcacc ctggagctgg ggagatcgtc 840 tgggtcagtg cagctccagc tacacatcat aagccccagt ggacaaggca gccaggagga 900 gggtggggca gatcagggca gataaccaga aggaccgagt ttggggccca gcatggccca 960 tatgggttta agatgcatgg agaggaagca cagggaggac atggggacag gcccatcccc 1020 tacccctgtg ttctggacct cagcgtcccc tccgtggccc ctctggccct ggcccaggga 1080 tgccttctgc ttcattgagc ccatctgttt gagacttgct aacaccttcc tcttttgtga 1140 ctggaagtca cagggccaat tgctaagccg tgcagtcaca gagggaacac agagcctagt 1200 tgtaaacgga cagagacgag aggggcaagg gaggacagtg gatgacaggg aagacgagtg 1260 ggggcagagc tgctcaggac catggctgag gccatcacct atgcagatct gaggtttgtg 1320 aaggctcccc tgaagaagag catctccagc cggttaggac agggtaaggg ggatgaggcc 1380 tccccttgat cctgcatccc cacttcccgc cccacagatc ccattcccaa atcccacctt 1440 ggtccctttt ctcttctctc tagacccagg ggctgatgat gatggggaaa tcacctacga 1500 gaatgttcaa gtgcccgcag tcctaggggt gccctcaagc ttggcttctt ctgtactagg 1560 ggacaaagca ggtctggaga gcctggggga tgtgtgtttg ttggggggct gtgtcttgag 1620 ctggggagtc cacaatgttt ttgttgttgt tcttgagaca gggtcttgtt ctgtcactca 1680 ggctggagtg cagtggtgtg atcatggctc actgcagcct cgacctcctg ggctcaagtg 1740 atcctcctgc cctggcctcc caaagtgttg ggattgcagg cgtgcgcccc catgccatgg 1800 ccctcagtat tcttgagagc acaataccct ggggaaagga ggagggcggt tcttaggaaa 1860 cagaagaata ggaacaggat aaatgggagc caggggagga agaaagattc ctcccgcaag 1920 gtggagaccc ctttcgggga gataagggct gcggagggat agggcagccc tgtgggggat 1980 gagttctaat gggaaacggg gagccatggg agggaaaaga ggtgatggga gatggatgag 2040 agcaaagggg gagagggttt ggagccgaca gataaggggc ctctgaggaa acgataaagg 2100 gagaactagg agagatgagg gggtcccggg aaaagataca gtgagaccca ggaggaagcc 2160 gagagagtcc cgggcagagg aggaactcag gagagtgaca ctgtgtccca gagcagctct 2220 agcaggcaac gtagggcaga ctggagtttc ccagtggcga gggcgaagcg ggtgcgcagg 2280 aggagaaggg gcaacactcg gggcttccgg tctcacactg gatgctcccc tcgacagcgg 2340 tcaagtcgga gcagccaact gcgtcctgga gagccgtgac gtcaccagct gtcgggcgga 2400 ttctcccctg tgagtgtgcc tgtgcggggc agccgcgggg ccaagtgctc gcgcctgtcc 2460 cggctgcgga tggcgctggg ccccgcgggg tgcagctcct tccgggataa acccgggcgc 2520 agtccttccg acgcccgggc tgccgtgcca cattcacctt catccggtgc gcctgaatcc 2580 gacgtgcacc gcccccctgc ggatggtccc aattcaccta cctgcccccc ttcccctctc 2640 tccgtctgat atttccttcc ccttctccca ggtaacgccc ccgagtcccc accagcactg 2700 attctggaac accctctccc ctacctgcga aacaccccgc tacccgacag aaaccaccat 2760 gcttgccccc tcagctctgc accgccctcc tcttagggat gccttcccca ggggtctccc 2820 ccacggacct gtcccttcct ctctacattc tttacggggg ctgactgccc tcatccatcc 2880 cactgtcccc ctaaggccgc acaacctgcc tgcgatacct cctgctcggc ctgctcctca 2940 cctgcctgct gttaggagtg accgccatct gcctgggagt gcgctgtgag taaggctgac 3000 cctacttccc acttacaccg aacgactttc gtcccgactc tcgccccaaa gctgatcttc 3060 ctcagcaggg atcatagtca ctaccaccac ctccttagct ttgggtttgg tttccagcca 3120 gggctgttgt cttcctgaat tgtcctcacc taatccagta cctccttacg ctttagtcag 3180 tcaaatcttg tatttcagcc tgtgatgcaa aaagttgcaa gaagcagaaa aagggctgat 3240 gctgcagtct cagggcacta gggcagggct ggagaagaca tccaccaaat ctctgtggcc 3300 aaacagatct gcaggtgtct cagcagctcc agcagacgaa cagggttctg gaagtcacta 3360 acagcagcct gaggcagcag ctccgcctca agataacgca gctgggacag agtgcagagg 3420 atctgcaggg gtccaggaga gagctggcgc agagtcagga agcactacag gtggaacaga 3480 gggctcatca ggcggccgaa gggcagctac aggcctgcca ggcagacaga cagaagacga 3540 aggagacctt gcaaagtgag gagcaacaga ggagggcctt ggagcagaag ctgagcaaca 3600 tggagaacag actgaagccc ttcttcacat gcggctcagc aggtatccgc tctggggaga 3660 ggagagaagg gagggatgga ttggagcaag gagatagtag tggcaggagt ctatcaccca 3720 ggggcctgtg aggtgttcag gaaaggggca gtgaaccaga gtggtggctg acatgagctg 3780 ttcagccttg gccaacagag agaaggacca agctgcatac tagatttagg gggacccacc 3840 cagctccaag gctcatcctg taggggagag gaggatggca gccagcagtc caaatcacac 3900 tgtccaaggt ttcttccggt tgatatcagt gacaaggatc tgggaacgcc cccaaatttg 3960 gatgtgaact tggcggtttg cacatctttc taaagacagg gacctcattt ttaattaatt 4020 ttcgttatag ctgagatctc cccaaagggt aagacttact ggctgacatg gagtgcaacc 4080 tggtggtggg gcaaggcagt gaggcgctca gcactcacag tgaggtggag ctgggagttg 4140 tggtgtaggg aactaggatg gggtgtccat tcattttgaa agcaggaccc cagccagtaa 4200 gtggagatgg ggaaagggac acataaaaga ggcaaggctg tatcaaaaca ggctgagggt 4260 ctaggcaagg tgccacgcct aaggggactg gggtgttttt gggctatgat caggtttagg 4320 ctctgtctag cttgggggtc atctgggtga tctccccagg cataagacag caagattatg 4380 tgcagacacc ccttgccctc atttccttaa ccttaccctc tgtgaccatc agttccatgc 4440 cagggctgct cttagcagca tgttagacct ctacctccca aattgctctt ccggcgaaag 4500 gatggtttca gacttccact ctgatcacag ctgcccacca ctcctgcagg ttttccccac 4560 acttttttga tctcatcaag gcctccaatc tgctgactcc cctttttctc ccttcccatc 4620 agtccaggcc tccaatcaac tgattcccct ttttctctct tcccatcagc ttctcctgtc 4680 cacacttctc tccctaccta acatgggttt ggagcccatc acttcaacta ctcacttatt 4740 tttttaaata gagacaaggt ctcactatgt ttccctggcc ggtatcgaac tcctggactc 4800 aagcgatcct cctgcctcag cctcccaaag tgctgagatt acagtcgtga gccaccgcac 4860 ctggctcact cacttctttg ttccattatc cttctagtgt acctgatggg caaaatttga 4920 actctggatg aacccaactg cccatggtca gcatcaaggt gtgacaaatt gtggaatcat 4980 tctgctcagt gccactgaga ctctagtctc tcagcactgc ctggcaacct ttgcaatggt 5040 cagcttgctc tccatagtaa ctccttctcc ctcactttca gcagatgtcc tcatcttcca 5100 ctctttcaga gaggaattcc cttcactttc catcaccaag tctatatatc catatggatt 5160 tttgcccatc ctcatctttt gcaaggatat ctgcaacagc tttctaaatg tttttcctac 5220 tcactcctct ccctgcagtc cattcttcat attggagcca gggtaatctt ttacaaacaa 5280 aaatctgatc gtgtcaccca atctacagca ttccagttaa ttgattacat aaacagttat 5340 ggaatacata acatataata tgttctagca ctgtgcccag gcacttggga tatagctagt 5400 gaacaaaatt cctctatgta aaaccctaca ctgactttct cttcctcata atataaaaat 5460 aaaaatcctt aacatggctt ccatcactcc tgcctacctt tcagtgtgcc acttatctcc 5520 tcaccctctt ccatttggcc atacagattt cttttttttt ttttttgagg cggagtctcg 5580 ctctgtgaga ctgaagtgca ctcccaggct ggagtgcagt ggtgtgatct gggctcactg 5640 caacctctgc ctcctggatt caagctatcc ttttgcctca gcctcccgag taactgggat 5700 tataggcatg cgccatggtg cctgggtatt ttttggtatt tttagtagag gtggggtttc 5760 accatgttga ccaggctggt cttaaactcc tgacttcagg tgatccaccc cgcctcggcc 5820 tcccaaagtg ctgggattac agaagtgagc cattgtgccc agcagggatt tctttttatt 5880 cctttaagtt gccataggac agtagctcaa gggagtagca gggtggggac atgagcatat 5940 gtagagagaa gagagggagc tagtggagag ggaatgattt ttgagtgatg aaggtaggac 6000 tgatggaaca aagcaatgga gagaggtggc ctgggatcag aagtgcacat tgggatgtag 6060 gatgttgaag aaagagggat atttatttct tatggatagt caggaaaaga agaacagaga 6120 agagaaattt tgaggtggag aggagggaaa aggagagagc tcagtttctg caaaacagta 6180 cacaacattt tctgcttgta ggaaattgga gagggacgga gggggccagt gctggaggag 6240 tggaggatgt gcagtgaggg gtgctttagg gagtggagga gatgggccca agagatgtta 6300 ggtaaagtgt cagggtggac tgcaattgtt cagaaattgg ggtaataaaa tcagccctat 6360 taagacagac ctcccagatt gctaagaagt aatgttgttg aatgttaaag tctttttact 6420 aagtaaggac aggtatggtg tttccagggg atgcaaaaga aagattgcaa agatcataga 6480 agtccatgaa ggtgctgaga ttagctctgg gggaatattg ttagcaaaag attgctgagt 6540 agtcactttc atcccaactg ttgctgctat cacactcttt ctgcttttta gacacctgct 6600 gtccgtcggg atggataatg catcagaaaa gctgctttta catctcactt acttcaaaaa 6660 attggcagga gagccaaaaa caatgtgaaa ctctgtcttc caagctggcc acattcagtg 6720 aaatttatcc acaatcagta agcatatcct tactcacaga ctgcagtggg tactattagg 6780 taaagggaca tatgtcatca agtcaggaat ggcagtgcac acatggctgg ggtggccatt 6840 ccttttccat ggaagaaact gagaatcata atactcttga cctctgagct tggaaacaac 6900 ctctgcatct tctcagcaca gtggtccagg cagcccctag taactgatta gagttggcat 6960 gagaatggaa acttaaccat atctacattt tttttttttt gagacagagt ctcactctgt 7020 cacccaggct ggagtgcact tgcataatct cagctcactg caacctccgc ctcctgggtt 7080 catgcaattc tcccgtctca gcctcctgag tagctgggac tacagtcaca tgccaccacg 7140 cccagctaat ttttgtattt ttaatagagt cggggtttca ccatattggt caggctggtc 7200 tctaactcct ggcctcaggt gatccacctg cttcagtctc ccaaagtgct gggattatag 7260 gtgtgaacca ctgcgcccgg cctctacttt aatgtgtatc tcatctcagc ttgctggtca 7320 gctccctggg ctgcttgaca ggttcctgga ctcttgtggc ctctcaggca gagaggtccc 7380 acacctggtg ggctggattg tagggattag ttccttgggg ccagaacagc acacagaatg 7440 actcccctcc aaaggctggc agagattctt actcagctgg aaggcaaatt tagcagtgag 7500 cctgcatctg cagggtggca cttgatggat tcagggacaa gttaccaaag agctgtcata 7560 tatacagccc ccttgccaaa gctgaggcct ccctagggtg tgcattttct tccatatttc 7620 tcatcactct ccatgtatct ctggtctctc tgaatctccc ctcagcactc ttactacttc 7680 ttaaattcac tgttgccaaa tggtggttca gggaattcat attggactgg cctcagctct 7740 aacaaggatt ggaagttgac tgatgataca caacgcacta ggtaagtttg ttaggacttc 7800 tgggtatttt caactccata atcccctcca taaagacatg gtaatcagat ttccttccca 7860 cctggtcccc tcaggactct cccataaatc aacactgtaa ttgcctattt gtctgtttcc 7920 tctatttgat taataacttc ttagggcaat cttattctcc attctatttc cacctagaag 7980 tccctggcac atactttttt gtttcagtaa ctactaggtt gggacctacg ctatctttta 8040 gggggagaca ttaaggattt agttaaggct ccacagctga tgacagcagc gtttctccat 8100 ccttcgagaa ggccacccaa actgaggttg tactttctca gtagcggcca cttggggcac 8160 tagagctgcc tcagtgaagt acgcaaggga gcagcggggg taccatcgaa gctccacttc 8220 actgtgatca agacagcagt ggggtaccat tgaggatcaa ccctagggac cctgaaacag 8280 ggtggcaacc tccttatgtt cctcaattgg ccttcccatc tttttttttt ttttgagatg 8340 gagtctcgct ctgtcgccca gactggagtg cagtggtgca atctcagctc actgcaagct 8400 ccgcctcagg ggttcacacc attctcctgc ctcagcctcc ccagcagctg ggactacagg 8460 cgcacaccgc catgcccggc taattttttg tatttttagt agagacgggg tttcaccgtg 8520 ttagtcagga tggtctcgat gtcctgacct tgtgatccac ccgccttggc ctcccaaagt 8580 gctgggattg caggcatgag ccaccgtgcc cagcctggcc ttcccatctt taagatctcg 8640 agttattttt ccctctgctg gttctaaagt ctgttttagt tgaactccag acatctgtcc 8700 tgaaccatgc ccagatgtgg ccttaggcag gcagggtgag ttaggattta ggttttatac 8760 aggggaaggg gagagaaggg agggtgggat atgtccagag catatcccag ctctacctta 8820 ctacttttta ggacttatgc tcaaagctca aaatgtaaca aggtacataa aacttggtca 8880 tggtggacac tggagtcaga gtcatgtaga agttctcttc cctacatctg tgagatgaca 8940 gctttcaggt ttccagatta ggacagtcct ttgcactgag ttggtaagag tgagggatgg 9000 ggctgaggca tggggagtcc aggggcagag ggactcagag cagggactca gagcaggggc 9060 ccaggggccc gagagaaaac ttgtacctgg gatgaaagtt cctccacagc ctccagttgc 9120 ttggtctgct tgttagggtg gggtgggcag ggagggacag gtaaggccta caacttatgg 9180 gaggctggtg gtgtggctgt cgagagctga ggaggagatg gggtgagtgg gatgagggtt 9240 cagttaagtc aaccatggag ctcactgata ctttactttt ccagaccatc tccttcattc 9300 tgggcagtgc ccagccaccg gctgacccac acctgacact tccagccagt ctgctgcctg 9360 ctccctcttc ctgaaactgg actgttcctg ggaaaagggt gaagccacct ctagaaggga 9420 ctttggcctc cccccaagaa cttcccatgg tagaatgggg tgggggagga gggcgcacgg 9480 gctgagcgga taggggcggc ccggagccag ccaggcagtt ttattgaaat ctttttaaat 9540 aattgcacgt gttagtctca tgtgtcagca atgctgtgtc tggttcagtg at 9592 10 1551 DNA Homo sapiens misc_feature n is a or g or c or t/u, unknown, or other 10 ccgcagtcct aggggtgccc tcaagcttgg cttcttctgt actaggggac aaagcagcgg 60 tcaagtcgga gcagccaact gcgtcctgga gagccgtgac gtcaccagct gtcgggcgga 120 ttctcccctg ccgcacaacc tgcctgcgat acctcctgct cggcctgctc ctcacctgcc 180 tgctgttagg agtgaccgcc atctgcctgg gagtgcgctg tgagtaaggc tgaccctact 240 tcccacttac accgaacgac tttcgtcccg actctcgccc caaagctgat cttcctcagc 300 agggatcata gtcactacca ccacctcctt agctttgggt ttggtttcca gccagggctg 360 ttgtcttcct gaattgtcct cacctaatcc agtacctcct tacgctttag tcagtcaaat 420 cttgtatttc agcctgtgat gcaaaaagtt gcaagaagca gaaaaagggc tgatgctgca 480 gtctcagggc actagggcag ggctggagaa gacatccacc aaatctctgt ggccaaacag 540 atctgcaggt gtctcagcag ctccagcaga cgaacagggt tctggaagtc actaacagca 600 gcctgaggca gcagctccgc ctcaagataa cgcagctggg acagagtgca gaggatctgc 660 aggggtccag gagagagctg gcgcagagtc aggaagcact acaggtggaa cagagggctc 720 atcaggcggc cgaagggcag ctacaggcct gccaggcaga cagacagaag acgaaggaga 780 ccttgcaaag tgaggagcaa cagaggaggg ccttggagca gaagctgagc aacatggaga 840 acagactgaa gcccttcttc acatgcggct cagcagacac ctgctgtccg tcgggatgga 900 taatgcatca gaaaagctgc ttttacatct cacttacttc aaaaaattgg caggagagcc 960 aaaaacaatg tgaaactctg tcttccaagc tggccacatt cagtgaaatt tatccacaat 1020 cacactctta ctacttctta aattcactgt tgccaaatgg tggttcaggg aattcatatt 1080 ggactggcct cagctctaac aaggattgga agttgactga tgatacacaa cgcactagga 1140 cttatgctca aagctcaaaa tgtaacaagg tacataaaac ttggtcatgg tggacactgg 1200 agtcagagtc atgtagaagt tctcttccct acatctgtga gatgacagct ttcaggtttc 1260 cagattagga cagtcctttg cactgagttg accatctcct tcattctggg cagtgcccag 1320 ccaccggctg acccacacct gacacttcca gccagtctgc tgcctgctcc ctcttcctga 1380 aactggactg ttcctgggaa aagggtgaag ccacctctag aagggacttt ggcctccccc 1440 caagaacttc ccatggtaga atggggtggg ggaggagggc gcacgggctg agcggatagg 1500 ggcggcccgg agccagccag gcagttttat tgaaatcttt ttaaataatt g 1551 11 1802 DNA Homo sapiens 11 tggcagtcac agagggaaca cagagcctag ttgtaaacgg acagagacga gaggggcaag 60 ggaggacagt ggatgacagg gaagacgagt gggggcagag ctgctcagga ccatggctga 120 ggccatcacc tatgcagatc tgagacccag gggctgatga tgatggggaa atcacctacg 180 agaatgttca agtgcccgca gtcctagggg tgccctcaag cttggcttct tctgtactag 240 gggacaaagc agcggtcaag tcggagcagc caactgcgtc ctggagagcc gtgacgtcac 300 cagctgtcgg gcggattctc ccctgccgca caacctgcct gcgatacctc ctgctcggcc 360 tgctcctcac ctgcctgctg ttaggagtga ccgccatctg cctgggagtg cgctgtgagt 420 aaggctgtcc ctacttccca cttacaccga acgactttcg tcccgactct cgccccaaag 480 ctgatcttcc tcagcaggga tcatagtcac taccaccacc tccttagctt tgggtttggt 540 ttccagccag ggctgttgtc ttcctgaatt gtcctcacct aatccagtac ctccttacgc 600 tttagtcagt caaatcttgt atttcagcct gtgatgcaaa aagttgcaag aagcagaaaa 660 agggctgatg ctgcagtctc agggcactag ggcagggctg gagaagacat ccaccaaatc 720 tctgtggcca aacagatctg caggtgtctc agcagctcca gcagacgaac agggttctgg 780 aagtcactaa cagcagcctg aggcagcagc tccgcctcaa gataacgcag ctgggacaga 840 gtgcagagga tctgcagggg tccaggagag agctggcgca gagtcaggaa gcactacagg 900 tggaacagag ggctcatcag gcggccgaag ggcagctaca ggcctgccag gcagacagac 960 agaagacgaa ggagaccttg caaagtgagg agcaacagag gagggccttg gagcagaagc 1020 tgagcaacat ggagaacaga ctgaagccct tcttcacatg cggctcagca gacacctgct 1080 gtctgtcggg atggataatg catcagaaaa gctgctttta catctcactt acttcaaaaa 1140 attggcagga gagccaaaaa caatgtgaaa ctctgtcttc caagctggcc acattcagtg 1200 aaatttatcc acaatcacac tcttactact tcttaaattc actgttgcca aatggtggtt 1260 cagggaattc atattggact ggcctcagct ctaacaagga ttggaagttg actgatgata 1320 cacaacgcac taggacttat gctcaaagct caaaatgtaa caaggtacat aaaacttggt 1380 catggtggac actggagtca gagtcatgta gaagttctct tccctacatc tgtgagatga 1440 cagctttcag gtttccagat taggacagtc ctttgcactg agttgacact catgccaaca 1500 agaacctgtg cccctccttc ctaacctgag gcctggggtt cctcagacca tctccttcat 1560 tctgggcagt gcccagccac cggctgaccc acacctgaca cttccagcca gtctgctgcc 1620 tgctccctct tcctgaaact ggactgttcc tgggaaaagg gtgaagccac ctctagaagg 1680 gactttggcc tccccccaag aacttcccat ggtagaatgg ggtgggggag gagggcgcac 1740 gggctgagcg gataggggcg gcccggagcc agccaggcag ttttattgaa atctttttaa 1800 ac 1802 12 423 DNA Homo sapiens 12 aataccctcc gaccaccaca ccgacagctc tcgactcctc ctctacccca ctcaccctac 60 tcccaagtca attcagttgg tacctcgagt gactatgaaa tgaaaaggtc tgtgagtacg 120 gttgttcttg gacacgggga ggaaggattg gactccggac cccaaggagt ctggtagagg 180 aagtaagacc cgtcacgggt cggtggccga ctgggtgtgg actgtgaagg tcggtcagac 240 gacggacgag ggagaaggac tttgacctga caaggaccct tttcccactt cggtggagat 300 cttccctgaa accggagggg ggttcttgaa gggtaccatc ttaccccacc ccctcctccc 360 gcgtgcccga ctcgcctatc cccgccgggc ctcggtcggt ccgtcaaaat aactttagaa 420 aaa 423 13 293 DNA Homo sapiens 13 ggtaaacgga cagagacgag aggggcaagg gaggacagtg gatgacaggg aagacgagtg 60 ggggcagagc tgctcaggac catggttttt ttgtcatcac ctatgcagat ctgagaccca 120 ggggctgatg atgatgggga aatcacctac gagaatgttc aagtgcccgc agtcctaggg 180 gagccctcaa gcttggcttc ttctgtacta ggggacaaag cagcggtcaa gtcggagcag 240 ccaactgcgt cctggagagc cgcgacgtca ccagctgtcg ggcggattct cgc 293 14 288 DNA Homo sapiens 14 ggtaaacgga cagagacgag aggggcaagg gaggacagtg gatgacaggg aagacgagtg 60 ggggcagagc tgctcaggat catggctgag gccatcacct atgcagatct gagacccagg 120 ggctgatgat gatggggaaa tcacctacga gaatgttcaa gtgcccgcag tcctaggggt 180 gccctcaagc ttggcttctt ctgtactagg ggacaaagca gcggtcaagt cggagcagcc 240 aactgcgtcc tggagagccg tgacgtcacc agttgtcggg cggattgt 288 15 9655 DNA Homo sapiens 15 ctattggcag agctctaggg tccctggcag tgtcctgcca cacaccacca atcatccgag 60 ctgttggcat gtggtgagga ccttttcccc cttccttttt cctccccaca gctgcaggag 120 ggggattctg cggggcggat gtcactttgc taggggcggg ttcttcccac ccccacacta 180 catcttgttc tagctccttc ctatcccgga tgtaggccat tttggggttc ctagtagggg 240 tcacatcccg gtaaaagaac attagggagc acctttgggc acccctgctc agcagccccc 300 tttcccccaa ctggttccct ctctcctcca tcagccaggc ccccaccacc tatgagggcc 360 tcgcggtccc tgggtcctcc gccacaaatg cccccttttt ccttggggct gggtgggaag 420 ggacagcttt gggacttttt aaagaaagaa ctttgctagg aaagtaagag acgaaccaaa 480 accaaaccaa gaaaatgtca agacctccgc tggtccggaa tctgcgctcg gcctctggct 540 ctgttctgct cctctgcctt gacttcacct aggaagctgc agccttctta agccctctgc 600 aactcccttg ctcacctgct ccctggctcc ccatgagggt gcatgggggg cagtgctgcc 660 acttgccttt ccagaggtgc tgcagccatg gctcttcctt accttcacct ctggcccagg 720 cctgttcctg aacccccagt tactggaacc cccaggatga tgagaagcga gggatgcaaa 780 ccaggaaccc ccagtctcat ggggagaacg gagcctcacc ctggagctgg ggagatcgtc 840 tgggtcagtg cagctccagc tacacatcat aagccccagt ggacaaggca gccaggagga 900 gggtggggca gatcagggca gataaccaga aggaccgagt ttggggccca gcatggccca 960 tatgggttta agatgcatgg agaggaagca cagggaggac atggggacag gcccatcccc 1020 tacccctgtg ttctggacct cagcgtcccc tccgtggccc ctctggccct ggcccaggga 1080 tgccttctgc ttcattgagc ccatctgttt gagacttgct aacaccttcc tcttttgtga 1140 ctggaagtca cagggccaat tgctaagccg tgcagtcaca gagggaacac agagcctagt 1200 tgtaaacgga cagagacgag aggggcaagg gaggacagtg gatgacaggg aagacgagtg 1260 ggggcagagc tgctcaggac catggttttt ttgtcatcac ctatgcagat ctgaggtttg 1320 tgaaggctcc cctgaagaag agcatctcca gccggttagg acagggtaag ggggatgagg 1380 cctccccttg atcctgcatc cccacttccc gccccacaga tcccattccc aaatcccacc 1440 ttggtccctt ttctcttctc tctagaccca ggggctgatg atgatgggga aatcacctac 1500 gagaatgttc aagtgcccgc agtcctaggg gtgccctcaa gcttggcttc ttctgtacta 1560 ggggacaaag caggtctgga gagcctgggg gatgtgtgtt tgttgggggg ctgtgtcttg 1620 agctggggag tccacaatgt ttttgttgtt gttcttgaga cagggtcttg ttctgtcact 1680 caggctggag tgcagtggtg tgatcatggc tcactgcagc ctcgacctcc tgggctcaag 1740 tgatcctcct gccctggcct cccaaagtgt tgggattgca ggcgtgcgcc cccatgccat 1800 ggccctcagt attcttgaga gcacaatacc ctggggaaag gaggagggcg gttcttagga 1860 aacagaagaa taggaacagg ataaatggga gccaggggag gaagaaagat tcctcccgca 1920 aggtggagac ccctttcggg gagataaggg ctgcggaggg atagggcagc cctgtggggg 1980 atgagttcta atgggaaacg gggagccatg ggagggaaaa gaggtgatgg gagatggatg 2040 agagcaaagg gggagagggt ttggagccga cagataaggg gcctctgagg aaacgataaa 2100 gggagaacta ggagagatga gggggtcccg ggaaaagata cagtgagacc caggaggaag 2160 ccgagagagt cccgggcaga ggaggaactc aggagagtga cactgtgtcc cagagcagct 2220 ctagcaggca acgtagggca gactggagtt tcccagtggc gagggcgaag cgggtgcgca 2280 ggaggagaag gggcaacact cggggcttcc ggtctcacac tggatgctcc cctcgacagc 2340 ggtcaagtcg gagcagccaa ctgcgtcctg gagagccgtg acgtcaccag ctgtcgggcg 2400 gattctcccc tgtgagtgtg cctgtgcggg gcagccgcgg ggccaagtgc tcgcgcctgt 2460 cccggctgcg gatggcgctg ggccccgcgg ggtgcagctc cttccgggat aaacccgggc 2520 gcagtccttc cgacgcccgg gctgccgtgc cacattcacc ttcatccggt gcgcctgaat 2580 ccgacgtgca ccgcccccct gcggatggtc ccaattcacc tacctgcccc ccttcccctc 2640 tctccgtctg atatttcctt ccccttctcc caggtaacgc ccccgagtcc ccaccagcac 2700 tgattctgga acaccctctc ccctacctgc gaaacacccc gctacccgac agaaaccacc 2760 atgcttgccc cctcagctct gcaccgccct cctcttaggg atgccttccc caggggtctc 2820 ccccacggac ctgtcccttc ctctctacat tctttacggg ggctgactgc cctcatccat 2880 cccactgtcc ccctaaggcc gcacaacctg cctgcgatac ctcctgctcg gcctgctcct 2940 cacctgcctg ctgttaggag tgaccgccat ctgcctggga gtgcgctgtg agtaaggctg 3000 tccctacttc ccacttacac cgaacgactt tcgtcccgac tctcgcccca aagctgatct 3060 tcctcagcag ggatcatagt cactaccacc acctccttag ctttgggttt ggtttccagc 3120 cagggctgtt gtcttcctga attgtcctca cctaatccag tacctcctta cgctttagtc 3180 agtcaaatct tgtatttcag cctgtgatgc aaaaagttgc aagaagcaga aaaagggctg 3240 atgctgcagt ctcagggcac tagggcaggg ctggagaaga catccaccaa atctctgtgg 3300 ccaaacagat ctgcaggtgt ctcagcagct ccagcagacg aacagggttc tggaagtcac 3360 taacagcagc ctgaggcagc agctccgcct caagataacg cagctgggac agagtgcaga 3420 ggatctgcag gggtccagga gagagctggc gcagagtcag gaagcactac aggtggaaca 3480 gagggctcat caggcggccg aagggcagct acaggcctgc caggcagaca gacagaagac 3540 gaaggagacc ttgcaaagtg aggagcaaca gaggagggcc ttggagcaga agctgagcaa 3600 catggagaac agactgaagc ccttcttcac atgcggctca gcaggtatcc gctctgggga 3660 gaggagagaa gggagggatg gattggagca aggagatagt agtggcagga gtctatcacc 3720 caggggcctg tgaggtgttc aggaaagggg cagtgaacca gagtggtggc tgacatgagc 3780 tgttcagcct tggccaacag agagaaggac caagctgcat actagattta gggggaccca 3840 cccagctcca aggctcatcc tgtaggggag aggaggatgg cagccagcag tccaaatcac 3900 actgtccaag gtttcttccg gttgatatca gtgacaagga tctgggaacg cccccaaatt 3960 tggatgtgaa cttggcggtt tgcacatctt tctaaagaca gggacctcat ttttaattaa 4020 ttttcgttat agctgagatc tccccaaagg gtaagactta ctggctgaca tggagtgcaa 4080 cctggtggtg gggcaaggca gtgaggcgct cagcactcac agtgaggtgg agctgggagt 4140 tgtggtgtag ggaactagga tggggtgtcc attcattttg aaagcaggac cccagccagt 4200 aagtggagat ggggaaaggg acacataaaa gaggcaaggc tgtatcaaaa caggctgagg 4260 gtctaggcaa ggtgccacgc ctaaggggac tggggtgttt ttgggctatg atcaggttta 4320 ggctctgtct agcttggggg tcatctgggt gatctcccca ggcataagac agcaagatta 4380 tgtgcagaca ccccttgccc tcatttcctt aaccttaccc tctgtgacca tcagttccat 4440 gccagggctg ctcttagcag catgttagac ctctacctcc caaattgctc ttccggcgaa 4500 aggatggttt cagacttcca ctctgatcac agctgcccac cactcctgca ggttttcccc 4560 acactttttt gatctcatca aggcctccaa tctgctgact cccctttttc tcccttccca 4620 tcagtccagg cctccaatca actgattccc ctttttctct cttcccatca gcttctcctg 4680 tccacacttc tctccctacc taacatgggt ttggagccca tcacttcaac tactcactta 4740 tttttttaaa tagagacaag gtctcactat gtttccctgg ccggtatcga actcctggac 4800 tcaagcgatc ctcctgcctc agcctcccaa agtgctgaga ttacagtcgt gagccaccgc 4860 acctggctca ctcacttctt tgttccatta tccttctagt gtacctgatg ggcaaaattt 4920 gaactctgga tgaacccaac tgcccatggt cagcatcaag gtgtgacaaa ttgtggaatc 4980 attctgctca gtgccactga gactctagtc tctcagcact gcctggcaac ctttgcaatg 5040 gtcagcttgc tctccatagt aactccttct ccctcacttt cagcagatgt cctcatcttc 5100 cactctttca gagaggaatt cccttcactt tccatcacca agtctatata tccatatgga 5160 tttttgccca tcctcatctt ttgcaaggat atctgcaaca gctttctaaa tgtttttcct 5220 actcactcct ctccctgcag tccattcttc atattggagc cagggtaatc ttttacaaac 5280 aaaaatctga tcgtgtcacc caatctacag cattccagtt aattgattac ataaacagtt 5340 atggaataca taacatataa tatgttctag cactgtgccc aggcacttgg gatatagcta 5400 gtgaacaaaa ttcctctatg taaaacccta cactgacttt ctcttcctca taatataaaa 5460 ataaaaatcc ttaacatggc ttccatcact cctgcctacc tttcagtgtg ccacttatct 5520 cctcaccctc ttccatttgg ccatacagat ttcttttttt ttttttttga ggcggagtct 5580 cgctctgtga gactgaagtg cactcccagg ctggagtgca gtggtgtgat ctgggctcac 5640 tgcaacctct gcctcctgga ttcaagctat ccttttgcct cagcctcccg agtaactggg 5700 attataggca tgcgccatgg tgcctgggta ttttttggta tttttagtag aggtggggtt 5760 tcaccatgtt gaccaggctg gtcttaaact cctgacttca ggtgatccac cccgcctcgg 5820 cctcccaaag tgctgggatt acagaagtga gccattgtgc ccagcaggga tttcttttta 5880 ttcctttaag ttgccatagg acagtagctc aagggagtag cagggtgggg acatgagcat 5940 atgtagagag aagagaggga gctagtggag agggaatgat ttttgagtga tgaaggtagg 6000 actgatggaa caaagcaatg gagagaggtg gcctgggatc agaagtgcac attgggatgt 6060 aggatgttga agaaagaggg atatttattt cttatggata gtcaggaaaa gaagaacaga 6120 gaagagaaat tttgaggtgg agaggaggga aaaggagaga gctcagtttc tgcaaaacag 6180 tacacaacat tttctgcttg taggaaattg gagagggacg gagggggcca gtgctggagg 6240 agtggaggat gtgcagtgag gggtgcttta gggagtggag gagatgggcc caagagatgt 6300 taggtaaagt gtcagggtgg actgcaattg ttcagaaatt ggggtaataa aatcagccct 6360 attaagacag acctcccaga ttgctaagaa gtaatgttgt tgaatgttaa agtcttttta 6420 ctaagtaagg acaggtatgg tgtttccagg ggatgcaaaa gaaagattgc aaagatcata 6480 gaagtccatg aaggtgctga gattagctct gggggaatat tgttagcaaa agattgctga 6540 gtagtcactt tcatcccaac tgttgctgct atcacactct ttctgctttt tagacacctg 6600 ctgtccgtcg ggatggataa tgcatcagaa aagctgcttt tacatctcac ttacttcaaa 6660 aaattggcag gagagccaaa aacaatgtga aactctgtct tccaagctgg ccacattcag 6720 tgaaatttat ccacaatcag taagcatatc cttactcaca gactgcagtg ggtactatta 6780 ggtaaaggga catatgtcat caagtcagga atggcagtgc acacatggct ggggtggcca 6840 ttccttttcc atggaagaaa ctgagaatca taatactctt gacctctgag cttggaaaca 6900 acctctgcat cttctcagca cagtggtcca ggcagcccct agtaactgat tagagttggc 6960 atgagaatgg aaacttaacc atatctacat tttttttttt ttgagacaga gtctcactct 7020 gtcacccagg ctggagtgca cttgcataat ctcagctcac tgcaacctcc gcctcctggg 7080 ttcatgcaat tctcccgtct cagcctcctg agtagctggg actacagtca catgccacca 7140 cgcccagcta atttttgtat ttttaataga gtcggggttt caccatattg gtcaggctgg 7200 tctctaactc ctggcctcag gtgatccacc tgcttcagtc tcccaaagtg ctgggattat 7260 aggtgtgaac cactgcgccc ggcctctact ttaatgtgta tctcatctca gcttgctggt 7320 cagctccctg ggctgcttga caggttcctg gactcttgtg gcctctcagg cagagaggtc 7380 ccacacctgg tgggctggat tgtagggatt agttccttgg ggccagaaca gcacacagaa 7440 tgactcccct ccaaaggctg gcagagattc ttactcagct ggaaggcaaa tttagcagtg 7500 agcctgcatc tgcagggtgg cacttgatgg attcagggac aagttaccaa agagctgtca 7560 tatatacagc ccccttgcca aagctgaggc ctccctaggg tgtgcatttt cttccatatt 7620 tctcatcact ctccatgtat ctctggtctc tctgaatctc ccctcagcac tcttactact 7680 tcttaaattc actgttgcca aatggtggtt cagggaattc atattggact ggcctcagct 7740 ctaacaagga ttggaagttg actgatgata cacaacgcac taggtaagtt tgttaggact 7800 tctgggtatt ttcaactcca taatcccctc cataaagaca tggtaatcag atttccttcc 7860 cacctggtcc cctcaggact ctcccataaa tcaacactgt aattgcctat ttgtctgttt 7920 cctctatttg attaataact tcttagggca atcttattct ccattctatt tccacctaga 7980 agtccctggc acatactttt ttgtttcagt aactactagg ttgggaccta cgctatcttt 8040 tagggggaga cattaaggat ttagttaagg ctccacagct gatgacagca gcgtttctcc 8100 atccttcgag aaggccaccc aaactgaggt tgtactttct cagtagcggc cacttggggc 8160 actagagctg cctcagtgaa gtacgcaagg gagcagcggg ggtaccatcg aagctccact 8220 tcactgtgat caagacagca gtggggtacc attgaggatc aaccctaggg accctgaaac 8280 agggtggcaa cctccttatg ttcctcaatt ggccttccca tctttttttt ttttttgaga 8340 tggagtctcg ctctgtcgcc cagactggag tgcagtggtg caatctcagc tcactgcaag 8400 ctccgcctca ggggttcaca ccattctcct gcctcagcct ccccagcagc tgggactaca 8460 ggcgcacacc gccatgcccg gctaattttt tgtattttta gtagagacgg ggtttcaccg 8520 tgttagtcag gatggtctcg atgtcctgac cttgtgatcc acccgccttg gcctcccaaa 8580 gtgctgggat tgcaggcatg agccaccgtg cccagcctgg ccttcccatc tttaagatct 8640 cgagttattt ttccctctgc tggttctaaa gtctgtttta gttgaactcc agacatctgt 8700 cctgaaccat gcccagatgt ggccttaggc aggcagggtg agttaggatt taggttttat 8760 acaggggaag gggagagaag ggagggtggg atatgtccag agcatatccc agctctacct 8820 tactactttt taggacttat gctcaaagct caaaatgtaa caaggtacat aaaacttggt 8880 catggtggac actggagtca gagtcatgta gaagttctct tccctacatc tgtgagatga 8940 cagctttcag gtttccagat taggacagtc ctttgcactg agttggtaag agtgagggat 9000 ggggctgagg catggggagt ccaggggcag agggactcag agcagggact cagagcaggg 9060 gcccaggggc ccgagagaaa acttgtacct gggatgaaag ttcctccaca gcctccagtt 9120 gcttggtctg cttgttaggg tggggtgggc agggagggac aggtaaggcc tacaacttat 9180 gggaggctgg tggtgtggct gtcgagagct gaggaggaga tggggtgagt gggatgaggg 9240 ttcagttaag tcaaccatgg agctcactga tactttactt ttccagacac tcatgccaac 9300 aagaacctgt gcccctcctt cctaacctga ggcctggggt tcctcagacc atctccttca 9360 ttctgggcag tgcccagcca ccggctgacc cacacctgac acttccagcc agtctgctgc 9420 ctgctccctc ttcctgaaac tggactgttc ctgggaaaag ggtgaagcca cctctagaag 9480 ggactttggc ctccccccaa gaacttccca tggtagaatg gggtggggga ggagggcgca 9540 cgggctgagc ggataggggc ggcccggagc cagccaggca gttttattga aatcttttta 9600 aataattgca cgtgttagtc tcatgtgtca gcaatgctgt gtctggttca gtgat 9655 16 14 PRT Homo sapiens 16 Met Ala Glu Ala Ile Thr Tyr Ala Asp Leu Arg Phe Val Lys 1 5 10 17 613 DNA Homo sapiens 17 aaattggcag gagagccaaa aacaatgtga aactctgtct tccaagctgg ccacattcag 60 tgaaatttat ccacaatcac actcttacta cttcttaaat tcactgttgc caaatggtgg 120 ttcagggaat tcatattgga ctggcctcag ctctaacaag gattggaagt tgactgatga 180 tacacaacgc actaggactt atgctcaaag ctcaaaatgt aacaaggtac ataaaacttg 240 gtcatggtgg acactggagt cagagtcatg tagaagttct cttccctaca tctgtgagat 300 gacagctttc aggtttccag attaggacag tcctttgcac tgagttgaca ctcatgccaa 360 caagaacctg tgcccctcct tcctaacctg acacttccag ccagtctgct gcctgctccc 420 tcttcctgaa actggactgt tcctgggaaa agggtgaagc cacctctaga agggactttg 480 gcctcccccc aagaacttcc catggtagaa tggggtgggg gaggagggcg cacgggctga 540 gcggataggg gcggcccgga gccagccagg cagttttatt gaaatctttt taaataataa 600 aaaaaaaaaa aaa 613 18 9584 DNA Homo sapiens 18 ctattggcag agctctaggg tccctggcag tgtcctgcca cacaccacca atcatccgag 60 ctgttggcat gtggtgagga ccttttcccc cttccttttt cctccccaca gctgcaggag 120 ggggattctg cggggcggat gtcactttgc taggggcggg ttcttcccac ccccacacta 180 catcttgttc tagctccttc ctatcccgga tgtaggccat tttggggttc ctagtagggg 240 tcacatcccg gtaaaagaac attagggagc acctttgggc acccctgctc agcagccccc 300 tttcccccaa ctggttccct ctctcctcca tcagccaggc ccccaccacc tatgagggcc 360 tcgcggtccc tgggtcctcc gccacaaatg cccccttttt ccttggggct gggtgggaag 420 ggacagcttt gggacttttt aaagaaagaa ctttgctagg aaagtaagag acgaaccaaa 480 accaaaccaa gaaaatgtca agacctccgc tggtccggaa tctgcgctcg gcctctggct 540 ctgttctgct cctctgcctt gacttcacct aggaagctgc agccttctta agccctctgc 600 aactcccttg ctcacctgct ccctggctcc ccatgagggt gcatgggggg cagtgctgcc 660 acttgccttt ccagaggtgc tgcagccatg gctcttcctt accttcacct ctggcccagg 720 cctgttcctg aacccccagt tactggaacc cccaggatga tgagaagcga gggatgcaaa 780 ccaggaaccc ccagtctcat ggggagaacg gagcctcacc ctggagctgg ggagatcgtc 840 tgggtcagtg cagctccagc tacacatcat aagccccagt ggacaaggca gccaggagga 900 gggtggggca gatcagggca gataaccaga aggaccgagt ttggggccca gcatggccca 960 tatgggttta agatgcatgg agaggaagca cagggaggac atggggacag gcccatcccc 1020 tacccctgtg ttctggacct cagcgtcccc tccgtggccc ctctggccct ggcccaggga 1080 tgccttctgc ttcattgagc ccatctgttt gagacttgct aacaccttcc tcttttgtga 1140 ctggaagtca cagggccaat tgctaagccg tgcagtcaca gagggaacac agagcctagt 1200 tgtaaacgga cagagacgag aggggcaagg gaggacagtg gatgacaggg aagacgagtg 1260 ggggcagagc tgctcaggac catggctgag gccatcacct atgcagatct gaggtttgtg 1320 aaggctcccc tgaagaagag catctccagc cggttaggac agggtaaggg ggatgaggcc 1380 tccccttgat cctgcatccc cacttcccgc cccacagatc ccattcccaa atcccacctt 1440 ggtccctttt ctcttctctc tagacccagg ggctgatgat gatggggaaa tcacctacga 1500 gaatgttcaa gtgcccgcag tcctaggggt gccctcaagc ttggcttctt ctgtactagg 1560 ggacaaagca ggtctggaga gcctggggga tgtgtgtttg ttggggggct gtgtcttgag 1620 ctggggagtc cacaatgttt ttgttgttgt tcttgagaca gggtcttgtt ctgtcactca 1680 ggctggagtg cagtggtgtg atcatggctc actgcagcct cgacctcctg ggctcaagtg 1740 atcctcctgc cctggcctcc caaagtgttg ggattgcagg cgtgcgcccc catgccatgg 1800 ccctcagtat tcttgagagc acaataccct ggggaaagga ggagggcggt tcttaggaaa 1860 cagaagaata ggaacaggat aaatgggagc caggggagga agaaagattc ctcccgcaag 1920 gtggagaccc ctttcgggga gataagggct gcggagggat agggcagccc tgtgggggat 1980 gagttctaat gggaaacggg gagccatggg agggaaaaga ggtgatggga gatggatgag 2040 agcaaagggg gagagggttt ggagccgaca gataaggggc ctctgaggaa acgataaagg 2100 gagaactagg agagatgagg gggtcccggg aaaagataca gtgagaccca ggaggaagcc 2160 gagagagtcc cgggcagagg aggaactcag gagagtgaca ctgtgtccca gagcagctct 2220 agcaggcaac gtagggcaga ctggagtttc ccagtggcga gggcgaagcg ggtgcgcagg 2280 aggagaaggg gcaacactcg gggcttccgg tctcacactg gatgctcccc tcgacagcgg 2340 tcaagtcgga gcagccaact gcgtcctgga gagccgtgac gtcaccagct gtcgggcgga 2400 ttctcccctg tgagtgtgcc tgtgcggggc agccgcgggg ccaagtgctc gcgcctgtcc 2460 cggctgcgga tggcgctggg ccccgcgggg tgcagctcct tccgggataa acccgggcgc 2520 agtccttccg acgcccgggc tgccgtgcca cattcacctt catccggtgc gcctgaatcc 2580 gacgtgcacc gcccccctgc ggatggtccc aattcaccta cctgcccccc ttcccctctc 2640 tccgtctgat atttccttcc ccttctccca ggtaacgccc ccgagtcccc accagcactg 2700 attctggaac accctctccc ctacctgcga aacaccccgc tacccgacag aaaccaccat 2760 gcttgccccc tcagctctgc accgccctcc tcttagggat gccttcccca ggggtctccc 2820 ccacggacct gtcccttcct ctctacattc tttacggggg ctgactgccc tcatccatcc 2880 cactgtcccc ctaaggccgc acaacctgcc tgcgatacct cctgctcggc ctgctcctca 2940 cctgcctgct gttaggagtg accgccatct gcctgggagt gcgctgtgag taaggctgtc 3000 cctacttccc acttacaccg aacgactttc gtcccgactc tcgccccaaa gctgatcttc 3060 ctcagcaggg atcatagtca ctaccaccac ctccttagct ttgggtttgg tttccagcca 3120 gggctgttgt cttcctgaat tgtcctcacc taatccagta cctccttacg ctttagtcag 3180 tcaaatcttg tatttcagcc tgtgatgcaa aaagttgcaa gaagcagaaa aagggctgat 3240 gctgcagtct cagggcacta gggcagggct ggagaagaca tccaccaaat ctctgtggcc 3300 aaacagatct gcaggtgtct cagcagctcc agcagacgaa cagggttctg gaagtcacta 3360 acagcagcct gaggcagcag ctccgcctca agataacgca gctgggacag agtgcagagg 3420 atctgcaggg gtccaggaga gagctggcgc agagtcagga agcactacag gtggaacaga 3480 gggctcatca ggcggccgaa gggcagctac aggcctgcca ggcagacaga cagaagacga 3540 aggagacctt gcaaagtgag gagcaacaga ggagggcctt ggagcagaag ctgagcaaca 3600 tggagaacag actgaagccc ttcttcacat gcggctcagc aggtatccgc tctggggaga 3660 ggagagaagg gagggatgga ttggagcaag gagatagtag tggcaggagt ctatcaccca 3720 ggggcctgtg aggtgttcag gaaaggggca gtgaaccaga gtggtggctg acatgagctg 3780 ttcagccttg gccaacagag agaaggacca agctgcatac tagatttagg gggacccacc 3840 cagctccaag gctcatcctg taggggagag gaggatggca gccagcagtc caaatcacac 3900 tgtccaaggt ttcttccggt tgatatcagt gacaaggatc tgggaacgcc cccaaatttg 3960 gatgtgaact tggcggtttg cacatctttc taaagacagg gacctcattt ttaattaatt 4020 ttcgttatag ctgagatctc cccaaagggt aagacttact ggctgacatg gagtgcaacc 4080 tggtggtggg gcaaggcagt gaggcgctca gcactcacag tgaggtggag ctgggagttg 4140 tggtgtaggg aactaggatg gggtgtccat tcattttgaa agcaggaccc cagccagtaa 4200 gtggagatgg ggaaagggac acataaaaga ggcaaggctg tatcaaaaca ggctgagggt 4260 ctaggcaagg tgccacgcct aaggggactg gggtgttttt gggctatgat caggtttagg 4320 ctctgtctag cttgggggtc atctgggtga tctccccagg cataagacag caagattatg 4380 tgcagacacc ccttgccctc atttccttaa ccttaccctc tgtgaccatc agttccatgc 4440 cagggctgct cttagcagca tgttagacct ctacctccca aattgctctt ccggcgaaag 4500 gatggtttca gacttccact ctgatcacag ctgcccacca ctcctgcagg ttttccccac 4560 acttttttga tctcatcaag gcctccaatc tgctgactcc cctttttctc ccttcccatc 4620 agtccaggcc tccaatcaac tgattcccct ttttctctct tcccatcagc ttctcctgtc 4680 cacacttctc tccctaccta acatgggttt ggagcccatc acttcaacta ctcacttatt 4740 tttttaaata gagacaaggt ctcactatgt ttccctggcc ggtatcgaac tcctggactc 4800 aagcgatcct cctgcctcag cctcccaaag tgctgagatt acagtcgtga gccaccgcac 4860 ctggctcact cacttctttg ttccattatc cttctagtgt acctgatggg caaaatttga 4920 actctggatg aacccaactg cccatggtca gcatcaaggt gtgacaaatt gtggaatcat 4980 tctgctcagt gccactgaga ctctagtctc tcagcactgc ctggcaacct ttgcaatggt 5040 cagcttgctc tccatagtaa ctccttctcc ctcactttca gcagatgtcc tcatcttcca 5100 ctctttcaga gaggaattcc cttcactttc catcaccaag tctatatatc catatggatt 5160 tttgcccatc ctcatctttt gcaaggatat ctgcaacagc tttctaaatg tttttcctac 5220 tcactcctct ccctgcagtc cattcttcat attggagcca gggtaatctt ttacaaacaa 5280 aaatctgatc gtgtcaccca atctacagca ttccagttaa ttgattacat aaacagttat 5340 ggaatacata acatataata tgttctagca ctgtgcccag gcacttggga tatagctagt 5400 gaacaaaatt cctctatgta aaaccctaca ctgactttct cttcctcata atataaaaat 5460 aaaaatcctt aacatggctt ccatcactcc tgcctacctt tcagtgtgcc acttatctcc 5520 tcaccctctt ccatttggcc atacagattt cttttttttt ttttttgagg cggagtctcg 5580 ctctgtgaga ctgaagtgca ctcccaggct ggagtgcagt ggtgtgatct gggctcactg 5640 caacctctgc ctcctggatt caagctatcc ttttgcctca gcctcccgag taactgggat 5700 tataggcatg cgccatggtg cctgggtatt ttttggtatt tttagtagag gtggggtttc 5760 accatgttga ccaggctggt cttaaactcc tgacttcagg tgatccaccc cgcctcggcc 5820 tcccaaagtg ctgggattac agaagtgagc cattgtgccc agcagggatt tctttttatt 5880 cctttaagtt gccataggac agtagctcaa gggagtagca gggtggggac atgagcatat 5940 gtagagagaa gagagggagc tagtggagag ggaatgattt ttgagtgatg aaggtaggac 6000 tgatggaaca aagcaatgga gagaggtggc ctgggatcag aagtgcacat tgggatgtag 6060 gatgttgaag aaagagggat atttatttct tatggatagt caggaaaaga agaacagaga 6120 agagaaattt tgaggtggag aggagggaaa aggagagagc tcagtttctg caaaacagta 6180 cacaacattt tctgcttgta ggaaattgga gagggacgga gggggccagt gctggaggag 6240 tggaggatgt gcagtgaggg gtgctttagg gagtggagga gatgggccca agagatgtta 6300 ggtaaagtgt cagggtggac tgcaattgtt cagaaattgg ggtaataaaa tcagccctat 6360 taagacagac ctcccagatt gctaagaagt aatgttgttg aatgttaaag tctttttact 6420 aagtaaggac aggtatggtg tttccagggg atgcaaaaga aagattgcaa agatcataga 6480 agtccatgaa ggtgctgaga ttagctctgg gggaatattg ttagcaaaag attgctgagt 6540 agtcactttc atcccaactg ttgctgctat cacactcttt ctgcttttta gacacctgct 6600 gtccgtcggg atggataatg catcagaaaa gctgctttta catctcactt acttcaaaaa 6660 attggcagga gagccaaaaa caatgtgaaa ctctgtcttc caagctggcc acattcagtg 6720 aaatttatcc acaatcagta agcatatcct tactcacaga ctgcagtggg tactattagg 6780 taaagggaca tatgtcatca agtcaggaat ggcagtgcac acatggctgg ggtggccatt 6840 ccttttccat ggaagaaact gagaatcata atactcttga cctctgagct tggaaacaac 6900 ctctgcatct tctcagcaca gtggtccagg cagcccctag taactgatta gagttggcat 6960 gagaatggaa acttaaccat atctacattt tttttttttt gagacagagt ctcactctgt 7020 cacccaggct ggagtgcact tgcataatct cagctcactg caacctccgc ctcctgggtt 7080 catgcaattc tcccgtctca gcctcctgag tagctgggac tacagtcaca tgccaccacg 7140 cccagctaat ttttgtattt ttaatagagt cggggtttca ccatattggt caggctggtc 7200 tctaactcct ggcctcaggt gatccacctg cttcagtctc ccaaagtgct gggattatag 7260 gtgtgaacca ctgcgcccgg cctctacttt aatgtgtatc tcatctcagc ttgctggtca 7320 gctccctggg ctgcttgaca ggttcctgga ctcttgtggc ctctcaggca gagaggtccc 7380 acacctggtg ggctggattg tagggattag ttccttgggg ccagaacagc acacagaatg 7440 actcccctcc aaaggctggc agagattctt actcagctgg aaggcaaatt tagcagtgag 7500 cctgcatctg cagggtggca cttgatggat tcagggacaa gttaccaaag agctgtcata 7560 tatacagccc ccttgccaaa gctgaggcct ccctagggtg tgcattttct tccatatttc 7620 tcatcactct ccatgtatct ctggtctctc tgaatctccc ctcagcactc ttactacttc 7680 ttaaattcac tgttgccaaa tggtggttca gggaattcat attggactgg cctcagctct 7740 aacaaggatt ggaagttgac tgatgataca caacgcacta ggtaagtttg ttaggacttc 7800 tgggtatttt caactccata atcccctcca taaagacatg gtaatcagat ttccttccca 7860 cctggtcccc tcaggactct cccataaatc aacactgtaa ttgcctattt gtctgtttcc 7920 tctatttgat taataacttc ttagggcaat cttattctcc attctatttc cacctagaag 7980 tccctggcac atactttttt gtttcagtaa ctactaggtt gggacctacg ctatctttta 8040 gggggagaca ttaaggattt agttaaggct ccacagctga tgacagcagc gtttctccat 8100 ccttcgagaa ggccacccaa actgaggttg tactttctca gtagcggcca cttggggcac 8160 tagagctgcc tcagtgaagt acgcaaggga gcagcggggg taccatcgaa gctccacttc 8220 actgtgatca agacagcagt ggggtaccat tgaggatcaa ccctagggac cctgaaacag 8280 ggtggcaacc tccttatgtt cctcaattgg ccttcccatc tttttttttt ttttgagatg 8340 gagtctcgct ctgtcgccca gactggagtg cagtggtgca atctcagctc actgcaagct 8400 ccgcctcagg ggttcacacc attctcctgc ctcagcctcc ccagcagctg ggactacagg 8460 cgcacaccgc catgcccggc taattttttg tatttttagt agagacgggg tttcaccgtg 8520 ttagtcagga tggtctcgat gtcctgacct tgtgatccac ccgccttggc ctcccaaagt 8580 gctgggattg caggcatgag ccaccgtgcc cagcctggcc ttcccatctt taagatctcg 8640 agttattttt ccctctgctg gttctaaagt ctgttttagt tgaactccag acatctgtcc 8700 tgaaccatgc ccagatgtgg ccttaggcag gcagggtgag ttaggattta ggttttatac 8760 aggggaaggg gagagaaggg agggtgggat atgtccagag catatcccag ctctacctta 8820 ctacttttta ggacttatgc tcaaagctca aaatgtaaca aggtacataa aacttggtca 8880 tggtggacac tggagtcaga gtcatgtaga agttctcttc cctacatctg tgagatgaca 8940 gctttcaggt ttccagatta ggacagtcct ttgcactgag ttggtaagag tgagggatgg 9000 ggctgaggca tggggagtcc aggggcagag ggactcagag cagggactca gagcaggggc 9060 ccaggggccc gagagaaaac ttgtacctgg gatgaaagtt cctccacagc ctccagttgc 9120 ttggtctgct tgttagggtg gggtgggcag ggagggacag gtaaggccta caacttatgg 9180 gaggctggtg gtgtggctgt cgagagctga ggaggagatg gggtgagtgg gatgagggtt 9240 cagttaagtc aaccatggag ctcactgata ctttactttt ccagacactc atgccaacaa 9300 gaacctgtgc ccctccttcc taacctgaca cttccagcca gtctgctgcc tgctccctct 9360 tcctgaaact ggactgttcc tgggaaaagg gtgaagccac ctctagaagg gactttggcc 9420 tccccccaag aacttcccat ggtagaatgg ggtgggggag gagggcgcac gggctgagcg 9480 gataggggcg gcccggagcc agccaggcag ttttattgaa atctttttaa ataattgcac 9540 gtgttagtct catgtgtcag caatgctgtg tctggttcag tgat 9584 19 22 DNA primer 19 atcagggcag ataaccagaa gg 22 20 21 DNA primer 20 aaacattgtg gactccccag c 21 21 1357 DNA Mus sp. 21 tagctggaag actgtgaagc agaggcgccc agggctatgg ctgacgctat cacgtatgca 60 gacctgcgct ttgtgaaagt gcccctgaag aacagcgcat ctaaccatct aggacaggac 120 tgtgaggcct atgaagatgg ggaactcacc tacgagaacg tgcaagtgtc tccagtccca 180 ggagggccac caggcttggc ttcccctgca ctagcggaca aagcaggtcc tacagtctgc 240 ttgcaatact tcttgcttgg ccttctcgtg tcctgtctga tgttaggggt ggctgtcatc 300 tgcctgggag ttcgctatct gcaggtgtct cggcagttcc aggaggggac caggatttgg 360 gaagccacca atagcagcct gcagcagcag ctcagggaga agataagtca gctggggcag 420 aaggaggtgg agcttcagaa ggctcggaaa gagctgatct cgagccagga cacattacag 480 gagaagcaga ggactcacga ggacgctgag cagcaactac aagcctgcca ggctgagaga 540 gcgaagacca aggagaacct gaaaactgag gaggagcgga ggagggacct ggaccagagg 600 ttgacaagca cgcgggagac actgaggcgc ttcttctctg attcatcaga cacctgctgt 660 ccatgcggat ggattccata tcaggaaagg tgcttttaca tctcacatac cctcggaagt 720 ctggaggaga gccaaaaata ctgcacatct ctgtcctcca aactggcagc attcgatgaa 780 ccttctaagc attactatga atacctctct gacgcccccc aggtttctct gcccagcggc 840 ttagaggagt tgctagatcg ttcgaagtta tattggatac agatgagcaa gaagtggagg 900 caggactctg actctcaaag ccgacattgt gtcaggataa aaacatatta ccagaagtgg 960 gaaagaacaa tttccaagtg tgcagagctt cacccctgca tttgtgagtc ggaggctttc 1020 aggtttcctg atgggatcaa tctgaactga aacggacact tgaacaagac cttgtgacct 1080 acatccttaa cctacggcct gccaattttt aagactgcta ttcctccagc actccctcac 1140 tctcgggcat gcccagctaa gggatgacct gctgcttgct tgaaagctgc tccagaaact 1200 ggacttctct tgggaagagt aaagaagcct ccagaaaaga cttgaccttc cttaagaact 1260 tcccaaacta gagatgggtc aggggagggc gctccgctga gtggatggat ggcccggagc 1320 cagccaggca gttttattga aatattttta aatactt 1357 22 337 PRT Mus sp. 22 Met Ala Asp Ala Ile Thr Tyr Ala Asp Leu Arg Phe Val Lys Val Pro 1 5 10 15 Leu Lys Asn Ser Ala Ser Asn His Leu Gly Gln Asp Cys Glu Ala Tyr 20 25 30 Glu Asp Gly Glu Leu Thr Tyr Glu Asn Val Gln Val Ser Pro Val Pro 35 40 45 Gly Gly Pro Pro Gly Leu Ala Ser Pro Ala Leu Ala Asp Lys Ala Gly 50 55 60 Pro Thr Val Cys Leu Gln Tyr Phe Leu Leu Gly Leu Leu Val Ser Cys 65 70 75 80 Leu Met Leu Gly Val Ala Val Ile Cys Leu Gly Val Arg Tyr Leu Gln 85 90 95 Val Ser Arg Gln Phe Gln Glu Gly Thr Arg Ile Trp Glu Ala Thr Asn 100 105 110 Ser Ser Leu Gln Gln Gln Leu Arg Glu Lys Ile Ser Gln Leu Gly Gln 115 120 125 Lys Glu Val Glu Leu Gln Lys Ala Arg Lys Glu Leu Ile Ser Ser Gln 130 135 140 Asp Thr Leu Gln Glu Lys Gln Arg Thr His Glu Asp Ala Glu Gln Gln 145 150 155 160 Leu Gln Ala Cys Gln Ala Glu Arg Ala Lys Thr Lys Glu Asn Leu Lys 165 170 175 Thr Glu Glu Glu Arg Arg Arg Asp Leu Asp Gln Arg Leu Thr Ser Thr 180 185 190 Arg Glu Thr Leu Arg Arg Phe Phe Ser Asp Ser Ser Asp Thr Cys Cys 195 200 205 Pro Cys Gly Trp Ile Pro Tyr Gln Glu Arg Cys Phe Tyr Ile Ser His 210 215 220 Thr Leu Gly Ser Leu Glu Glu Ser Gln Lys Tyr Cys Thr Ser Leu Ser 225 230 235 240 Ser Lys Leu Ala Ala Phe Asp Glu Pro Ser Lys His Tyr Tyr Glu Tyr 245 250 255 Leu Ser Asp Ala Pro Gln Val Ser Leu Pro Ser Gly Leu Glu Glu Leu 260 265 270 Leu Asp Arg Ser Lys Leu Tyr Trp Ile Gln Met Ser Lys Lys Trp Arg 275 280 285 Gln Asp Ser Asp Ser Gln Ser Arg His Cys Val Arg Ile Lys Thr Tyr 290 295 300 Tyr Gln Lys Trp Glu Arg Thr Ile Ser Lys Cys Ala Glu Leu His Pro 305 310 315 320 Cys Ile Cys Glu Ser Glu Ala Phe Arg Phe Pro Asp Gly Ile Asn Leu 325 330 335 Asn 23 18 DNA primer 23 agaggcgccc agggctat 18 24 21 DNA primer 24 ccctcccctg acccatctct a 21 25 19 DNA primer 25 agcatatccc agctctacc 19 26 20 DNA primer 26 tgacacatga gactaacacg 20 

What is claimed is:
 1. A method of diagnosing cancer in an individual, comprising detecting a mutation in the human CD72 gene, mRNA or protein, wherein the presence of the mutation is indicative of cancer.
 2. The method of claim 1, wherein the presence of the mutation in the CD72 gene is detected by direct mutation analysis by restriction digestion.
 3. The method of claim 1, wherein the presence of the mutation in the CD72 gene or mRNA is detected by sequence analysis of the CD72 gene or mRNA.
 4. The method of claim 1, wherein the presence of the mutation in the CD72 gene or mRNA is detected by hybridization of a nucleic acid probe to the CD72 gene or mRNA in the test sample from the individual.
 5. The method of claim 1, wherein the mutation in the CD72 gene or mRNA is detected by formation of a nucleic acid duplex wherein a first strand of said duplex comprises a nucleic acid probe having a sequence complimentary to part of the gene or mRNA encompassing the mutation, and the second strand of said duplex comprises a nucleic acid sequence of the gene which is complimentary to said probe.
 6. The method of claim 1, wherein the mutation is detected using PCR to amplify a fragment of the CD72 gene or mRNA encompassing the mutation, wherein the presence or absence of PCR product is indicative of the mutation.
 7. The method of claim 1, wherein the mutation is detected using PCR to amplify a fragment of the CD72 gene or mRNA encompassing the mutation, wherein a PCR product that is larger or smaller than a PCR product predicted for the wild-type CD72 gene or mRNA is indicative of the mutation.
 8. The method of claim 1, wherein the mutation is detected using PCR to amplify a fragment of the CD72 gene or mRNA encompassing the mutation, wherein the presence or absence of a restriction site in the PCR product is indicative of the mutation.
 9. The method of claim 1, wherein the mutation is detected using PCR to amplify a fragment of the CD72 gene or mRNA encompassing the mutation, and then probing for the amplified fragment using a nucleic acid probe having a sequence complimentary to part of the CD72 gene or mRNA encompassing the mutation, or by sequencing the amplified fragment.
 10. The method of claim 1, wherein the mutation is detected by detecting a difference in the relative electrophoretic mobility of a CD72 protein from an individual possessing the mutant CD72 protein, as compared to a normal CD72 protein, which is indicative of said mutation in said CD72 gene or mRNA.
 11. The method of claim 1, wherein the mutant CD72 gene is detected by using antibodies which distinguish between mutant CD72 protein and normal CD72 protein.
 12. The method of claim 11, wherein the antibodies are directed to mutant CD72 protein.
 13. The method of claim 11, wherein the antibodies are directed to normal CD72 protein.
 14. The method of claim 11, wherein the antibodies are monoclonal antibodies.
 15. The method of claim 1, wherein the mutant CD72 gene protein is detected by immunocytochemistry wherein said method utilizes an antibody or antibodies which distinguish between the mutant CD72 gene protein and normal CD72 gene protein.
 16. The method of claim 15, wherein the antibodies are directed to mutant CD72 protein.
 17. The method of claim 15, wherein the antibodies are directed to normal CD72 protein.
 18. The method of claim 1, wherein the cancer is thyroid follicular cancer.
 19. The method of claim 18, wherein the mutation results in a deletion of the first ITIM structure of CD72, wherein the presence of the mutation is indicative of thyroid follicular carcinoma.
 20. The method of claim 18, wherein the mutation results in a CD72 mRNA sequence with a corresponding cDNA sequence of SEQ. ID. NO:5.
 21. The method of claim 18, wherein the mutation results in a mutated genomic CD72 sequence of SEQ. ID. NO:4.
 22. The method of claim 18, wherein the mutation results in a CD72 protein with the sequence of SEQ. ID. NO:6.
 23. The method of claim 18, wherein the mutation is detected by comparing the amount of binding of antibody specific for the mutant CD72 protein of SEQ. ID. NO:6, wherein the antibody does not bind to normal CD72.
 24. A method of claim 1, wherein the cancer is renal cancer.
 25. The method of claim 24, wherein the mutation results in an mRNA sequence with a corresponding cDNA sequence of SEQ. ID. NO:7, wherein the presence of the mutation is indicative of renal cancer.
 26. The method of claim 24, wherein the mutation is detected by comparing the amount of binding of antibody specific for the mutant CD72 protein of SEQ. ID. NO:8, wherein the antibody does not bind to normal CD72.
 27. The method of claim 1, wherein the cancer is acute lymphocytic leukemia.
 28. The method of claim 27, wherein the mutation results in the deletion of codon 141 to codon 190 of SEQ. ID. NO:2, wherein the presence of the mutation is indicative of acute lymphocytic leukemia.
 29. The method of claim 27, wherein the mutation results in an mRNA sequence with a corresponding cDNA sequence which contains the sequence of SEQ. ID. NO:11.
 30. The method of claim 27, wherein the mutation results in an mRNA sequence with a corresponding cDNA sequence which contains the sequence of SEQ. ID. NO:12.
 31. The method of claim 27, wherein the mutation results in an mRNA sequence with a corresponding cDNA sequence which contains the sequence of SEQ. ID. NO:13.
 32. The method of claim 27, wherein the mutation results in an mRNA sequence with a corresponding cDNA sequence which contains the sequence of SEQ. ID. NO:14.
 33. The method of claim 27, wherein the mutation results in a protein with the sequence of SEQ. ID. NO:16.
 34. The method of claim 27, wherein the mutation is detected by comparing the amount of binding of antibody specific for the mutant CD72 protein of SEQ. ID. NO:16, wherein the antibody does not bind to normal CD72.
 35. The method of claim 1, wherein the cancer is endometrial adenocarcinoma.
 36. The method of claim 36, wherein the mutation results in a CD72 mRNA with a corresponding cDNA sequence of SEQ ID NO:10.
 37. The method of claim 35, wherein the mutation results in a mutated genomic CD72 sequence of SEQ ID NO:9.
 38. The method of claim 1, wherein the cancer is chondrosarcoma.
 39. The method of claim 38, wherein the mutation results in a CD72 mRNA with a corresponding cDNA sequence of SEQ ID NO:17.
 40. The method of claim 38, wherein the mutation results in a genomic CD72 sequence of SEQ ID NO:18.
 41. The method of claim 1, wherein the cancer is ovarian cancer.
 42. The method of claim 1, wherein the cancer is lymphoma.
 43. A method of diagnosing lupus erythematosus in an individual, comprising detecting a mutation in the human CD72 gene, mRNA or protein, wherein the presence of the mutation is indicative of lupus erythematosus.
 44. The method of claim 43, wherein the presence of the mutation in the CD72 gene is detected by direct mutation analysis by restriction digestion.
 45. The method of claim 43, wherein the presence of the mutation in the CD72 gene is detected by sequence analysis of the CD72 gene or mRNA.
 46. The method of claim 43, wherein the presence of the mutation in the CD72 gene or mRNA is detected by hybridization of a nucleic acid probe to the CD72 gene or mRNA in the test sample from the individual.
 47. The method of claim 43, wherein the CD72 mutation is detected by formation of a nucleic acid duplex wherein a first strand of said duplex comprises a nucleic acid probe having a sequence complimentary to part of the gene or mRNA encompassing the mutation, and the second strand of said duplex comprises a nucleic acid sequence of the gene or mRNA which is complimentary to said probe.
 48. The method of claim 43, wherein the mutation is detected using PCR to amplify a fragment of the CD72 gene or mRNA encompassing the mutation, wherein the presence or absence of PCR product is indicative of the mutation.
 49. The method of claim 43, wherein the mutation is detected using PCR to amplify a fragment of the CD72 gene or mRNA encompassing the mutation, wherein a PCR product that is larger or smaller than a PCR product predicted for the wild-type CD72 gene or mRNA is indicative of the mutation.
 50. The method of claim 43, wherein the mutation is detected using PCR to amplify a fragment of the CD72 gene or mRNA encompassing the mutation, wherein the presence or absence of a restriction site in the PCR product is indicative of the mutation.
 51. The method of claim 43, wherein the mutation is detected using PCR to amplify a fragment of the CD72 gene or mRNA encompassing the mutation, and then probing for the amplified fragment using a nucleic acid probe having a sequence complimentary to part of the gene or mRNA encompassing the mutation, or by sequencing the amplified fragment.
 52. The method of claim 43, wherein the mutation is detected by detecting a difference in the relative electrophoretic mobility of a CD72 protein from an individual possessing a mutant CD72 protein as compared to a normal CD72 protein which is indicative of said mutation in said CD72 gene or mRNA.
 53. The method of claim 43, wherein the mutant CD72 gene protein is detected by immunoblotting using antibodies which distinguish between mutant CD72 gene protein and normal CD72 gene protein.
 54. The method of claim 43, wherein the mutant CD72 gene protein is detected by a monoclonal antibody, wherein said monoclonal antibody distinguishes between mutant CD72 protein and normal CD72 protein.
 55. The method of claim 43, wherein the mutant CD72 gene protein is detected by immunocytochemistry wherein said method utilizes an antibody or antibodies which distinguish between the mutant CD72 gene protein and normal CD72 gene protein.
 56. The method of claim 43, wherein the mutation is detected by comparing the amount of binding of antibody specific for the mutant CD72 protein of SEQ. ID. NO: 22, wherein the antibody does not bind to normal CD72.
 57. The method of claim 43, wherein the mutation results in an mRNA with a corresponding CD72 cDNA sequence of SEQ ID NO:21.
 58. An antibody specific for a mutant CD72 protein, wherein the antibody does not bind to normal CD72 protein.
 59. The antibody of claim 58, which antibody is a monoclonal antibody.
 60. The antibody of claim 58, wherein the mutant CD72 protein has the sequence of SEQ. ID. NO:6.
 61. The antibody of claim 58, wherein the mutant CD72 protein has the sequence of SEQ. ID. NO:8.
 62. The antibody of claim 58, wherein the mutant CD72 protein has the sequence of SEQ. ID. NO:16.
 63. The antibody of claim 58, wherein the mutant CD72 protein has the sequence of SEQ. ID. NO:22.
 64. A method of supplying wild-type CD72 gene function to a cell which has lost said gene function by virtue of a mutation in a CD72 gene, comprising: introducing a wild-type CD72 gene into a cell which has lost said gene function such that said wild-type CD72 gene is expressed in the cell.
 65. The method of claim 64, wherein the wild-type CD72 gene introduced recombines with the endogenous mutant CD72 gene present in the cell by a double recombination event to correct the CD72 gene mutation.
 66. The method of claim 64, wherein the mutation is a deletion of the first ITIM domain of said CD72 gene.
 67. The method of claim 64, wherein the mutation results in a CD72 mRNA with a sequence corresponding to the cDNA sequence of SEQ ID NO:5.
 68. The method of claim 64, wherein the mutation is a deletion of codon 141 to codon 190 of SEQ ID. NO:2.
 69. The method of claim 64, wherein the mutation results in a CD72 mRNA with a sequence corresponding to the cDNA sequence of SEQ ID NO:10.
 70. The method of claim 64, wherein the mutation results in a CD72 mRNA with a sequence corresponding to the cDNA sequence of SEQ ID NO:7.
 71. A method of supplying wild-type CD72 gene function to a cell which has a mutation in a CD72 gene, comprising: introducing a portion of the wild-type CD72 gene into a cell which has lost said gene function such that said portion is expressed in the cell, said portion comprising a part of the wild-type CD72 gene which contains the mutation in the cell.
 72. A method of supplying wild-type CD72 protein function to a cell which has lost said protein function by virtue of a mutation in a CD72 gene, comprising: introducing wild-type CD72 protein into a cell which has lost said protein function. 